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Psychological Bulletin 


INFANT CARE AND PERSONALITY! 


HAROLD ORLANSKY 
Department of Anthropology, Yale University 


INTRODUCTION 


The importance of childhood experience to the formation of per- 
sonality was one of the great findings of psychoanalysis. No social 
scientist will dispute that general finding, but all will wish to elucidate 
it. 

The orthodox school of psychoanalysis, in keeping with Freud's 
genetic and biological orientation, has stressed the special significance 
of infantile suckling, excretory, and genital functions for the shaping 
of adult personality. Indeed, the Freudian oral-anal-genital character- 
ology rests upon the assumption that specific channeling of these in- 
fantile physiological urges by varied parental care elicits specific psycho- 
logical constellations in the individual.? The essential features of this 
characterology have been borrowed by some psychologists (116, pp. 
362-385). Reasoning from Freudian theory, pediatricians and psy- 
chiatrists have advocated systems of infant care which they believe 
will promote the growth of secure and unneurotic personalities. In 
general, these systems advise prolonged breast feeding on self-demand 
by the infant, gradual and late weaning, and the late and lenient 


1 The writer wishes to thank Drs. Louise Ames, Wendell Bennett, Ralph Linton, and 
i i iticize this paper in manuscript 
for having been kind enough to read and criticize this pap pt. 
ihe wer ability = energy to utilize all their advice, it would be a better paper. 

2 For statements of this characterology, see Freud (43; 44; 46, pp. 135-140), Abra- 
ham (1, pp. 248-279, 370-417), Glover (54), Jones (77), Lorand (96), and Fenichel (36, 
pp. 62-69, 278-284, 487-496). 

3 A very extensive bibliograph 
succeeding issue of such journals as Psych 
chiatry, Frank (38, 39, 40), Ribble (128), 
ardent advocates of this program. 


y could be cited here which would be swollen by each 
iatry and the American Journal of Orthopsy- 
and Moloney (107, 108, 109) are among the 
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induction of bowel and bladder training. From the same theorekinal 
base, some anthropologists and psychoanalysts have, be separ ai 
attempted to explain presumed character differences between rms os 
of different cultures in terms of their different patterns of infant dis- 
E cR theory, however, arose from clinical investigations of 
alid emm, and the reconstructions of infant experience obtained 
from the analytic couch do not constitute empirical findings on the 
infantile situation. The work of child analysts like Melanie Klein and 
Anna Freud has added something to our direct knowledge of child 
psychology, but they have generally been more interested in applying 
psychoanalytic theory than in testing it. After a comprehensive review 
of the evidence, it was thus possible for three psychologists to say that 
“although we have now been exposed for some time to psychoanalytic 
and other psychiatric hypotheses regarding the effects of birth trauma, 
weaning trauma, extreme emphasis on early control of urin 
defecation . . . we have almost no objective records of the development 
of children going through these experiences, or of experiments controll- 
ing certain aspects of the problem” (113, p. S75); Ten years later, 
Murphy opined that “some aspects" of these hypotheses had been 
experimentally confirmed, but ‘the observations are meager and the 
theory overpretentious" (114, p. 485). 
Sears (136) has broadly surveyed objective studies designed to test 
psychoanalytic theory, but no one has yet concentrated atte 
the subject of infant disciplines and compiled the factual evidence on 
the relation between these disciplines and the development of personal- 
ity. The present paper proposes to do this, gathering data from two 
Sources: (a) empirical, experimental, 
gations in Western culture, and (b) ev 
logical inquiries into other cultures. 
to the adequacy of much of the da 


ation and 


ntion upon 


Students as Kardiner (79, 80) and Du Bois (26 


, 27) to ex 
of social and economic instituti 


ons and broad cultural pa 
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INFANT CARE AND PERSONALITY 3 
FEATURES OF INFANT CARE 


Nursing Experiences 


It has been argued that many features of the infant's early feeding 
experiences contribute to the formation of personality. Among the 
factors to which special psychological significance has been attributed 
are breast vs. artificial feeding, length of breast feeding, self-demand 
vs. scheduled feeding, and weaning. Studies which have attempted to 
test the significance of these factors are summarized below, followed by 
a brief evaluation of their validity. 

Breast vs. bottle feeding. The chairman of the Committee on Mater- 
nal and Child Feeding of the National Research Council concluded a 
recent investigation of the advisability of breast vs. artificial feeding 
with the following statement: 


Although there is a voluminous literature on the subject of the emotional 
value of breast feeding to both mother and baby, it is regrettable that concrete 
evidence on this point is difficult to obtain. This is particularly regrettable in 
view of the rather emotional treatment the subject is receiving in both pro- 
fessional and lay circles. The necessary evidence will have to come from long 
observations on the development of the personalities of both breast-fed and 
artificially fed infants (3, p. 915). 


While really adequate evidence is, to be sure, unavailable, the following 
studies have come closest to approximating the required observations. 
Rogerson and Rogerson (131) traced 109 children who had been 
observed as infants in their London clinic seven years before. Those 
artificially fed during infancy had experienced more feeding difficulties, 
poorer health, and greater restlessness during infancy than had those 
breast fed during infancy. At the age of 7, the group which had been 
artificially fed in infancy was found to have a greater proportion of its 
members manifest poor appetite, poor sleep, enuresis, fears, nervousness, 
and a poorer showing in school than the breast-fed group. The authors 
conclude that ‘‘there is evidence strongly suggesting that there is a 
positive association between feeding difficulties in infancy and subse- 
quent psychological difficulties at elementary school age" (p. 1181). 

In a study at Joliet, Ill., Hoefer and Hardy (67) subjected 383 
children 7-13 years old to physical and mental tests, correlating their 
ratings on the tests with the length of breast feeding in infancy. These 
findings will be reviewed below, but at this point we are interested in 
their observation that "Children who were artificially fed were, on the 
whole, inferior physically and mentally to the breast fed" (67, p. 619). 
The artificially fed children were more poorly nourished, more suscep- 
tible to diseases, and slower in learning to talk and to walk than were 


the breast-fed children. 
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Both the Rogerson and Rogerson and the Hoefer em 
were based on general populations, with no effort mua dica et 
he groups studied for factors other than methoc o eec ing in 
"X vei 'The physical and psychological difficulties which they found 
ee. tiftcially fed children can be accepted as descriptively accurate 
is dn 4 a ulations; but the inference that artificial feeding was a 
M idle "sl [soe in the creation of the difficulties is not warranted by 
a puta Indeed, Rogerson and Rogerson expressly disavow the 
pieces of inferring such a causal relationship (131, p. 1173). That 
ill-health and feeding difficulties in infancy can often be taken as evi- 
dence of emotional maladjustment is, of course, widely recognized. 

Our inference that these difficulties were not necessarily due to the 
manner of infant feeding is supported by a study conducted by Faber 
and Sutton (34) which comes closer to the scientific ideal of controlled 
experiment than do the studies outlined above. These authors compared 
42 babies at the Stanford University clinic who had been breast-fed less 
than six weeks and were thereafter reared on bottled milk w 
equal number of babies who had been breast-fed for an aver 
forty weeks. In other respects than the type of milk, the diets of both 
groups were alike; 26 members of each group were paired for weight at 
birth, and all babies received "excellent care." It was found that the 
breast-fed babies gained more weight than the bottle-fed babies during 
the first quarter year, but this relationship was reversed during the 
rest of the year. The bottle-fed group had better health and fewer 
diseases than the breast-fed group. Accepting the psychosomatic cri- 
terion that physical and psychological health are apt to be correlated, 


this study would seem to show that artificial feeding need be in no way 
inferior to breast feeding in promoting the emotional well-being of the 
infant. 


ith an 
age of 


r society without cons 
parents and the 


dition of the mother. The norm for 
lower-class families i i 


Babies between birth and abo 


p ut five months of a. 
++. develop better in our nurser 


ge, when not breast-fed, 


y than in the average proletarian household. 


ë See Ericson (29), Rogerson and Rogerson (131, P- 1166), and Davis and Havig- 
hurst (18). 
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Their gain in weight is more regular and intestinal disturbances are less fre- 
quent. ... In times of illness, the absence of tension and anxiety which the 
young mother invariably feels is certainly of advantage to the child.® 


Above all, the character of the mother must be evaluated before an in- 
telligent opinion can be ventured as to whether breast or artificial feed- 
ing will better serve to promote the healthy development of the child’s 
personality: 

In some cases, even though supplementary feeding may be necessary, breast 
feeding can advantageously be continued to foster the mother-child relation- 
ship. In other cases, it may be advisable, in order to minimize maternal rejec- 
tion, to reduce breast feedings, despite the physical possibility of continuing 
them (48, p. 235). 


We have seen no evidence to favor the current belief among some 
pediatricians that breast feeding is inevitably more advantageous to 
the child than artificial feeding.” 

Length of breast feeding. In the study described above, Hoefer and 
Hardy found that the highest ratings in muscle tone, physical and 
mental development, and early age at walking and dentition were reg- 
istered by children who, as infants, had been breast-fed 4-9 months; 
children breast-fed 3 months or less and 10 months or more had lower 
ratings than did this medial group (67). 

Somewhat comparable results were obtained by Pearson in a study 
of 72 problem children at the Philadelphia Child Guidance Clinic. 
Children breast-fed less than 6 months or more than 10 months were 
more maladjusted at school and displayed timidity and nervousness 
with greater frequency than did those children who had been breast-fed 
from 6-10 months (117, p. 289). 

Childers and Hamil (15) examined the records of 469 problem chil- 
dren at the Harper Hospital clinic in Detroit, and correlated the fre- 


5 Freud and Burlingham (42, p. 12). It is only fair to report that the authors add: 
“Breast-fed babies are, of course, better off than bottle-fed babies wherever they are. 
Our best results are found in babies who are breast-fed by their own mothers in our 
home" (p. 13). There are obvious reasons why a nursery baby who receives individual 
care from its mother should progress better than one who shares with many other babies 
the services of a professional nurse; but the assertion that breast-fed babies are invari- 

bottle-fed babies remains unproven. 
"i lari erit e is so frequently considered de rigeur, it may be well to note that 
a contrary opinion has been expressed by two outstanding pediatricians: [Some people] 
... deplore the emotional losses which the bottle-fed baby suffers. It is ieri m 
the nervous system of the modern infant is adaptive enough to meet most of the deman : 
made by new modes of nutrition. Indeed, oncoming generations of infants by virtue o 
these very changes may even acquire emotional patterns of self-dependence which will 
ultimately add to the mental stamina of the race and form personalities better suited to 
the stresses of civilization” (Gesell and Ilg, 52, p. 42). 
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quency of occurrence of 95 “personality traits" with the nr 
breast feeding in infancy. Children breast-fed 1-5 months os 
higher percentage of ‘‘abnormal behavior manifestations” than c hi c nu 
breast-fed less than 1 month or more than 5 months. However, this 
study is quite unsatisfactory methodologically, and its results can be 
given little credence. - "m — 

Maslow and Szilagyi-Kessler (102) gave a security-insecurity inven- 
tory to students in classes at Brooklyin College and correlated this 
measure of security with the length of infantile breast feeding. “The 
highest security scores were obtained by the groups w 
fed little or not at all and by those who were fed 
a year" (102, p. 84). 

Peterson and Spano (118) correlated the length of bre 
with personality ratings of nursery school children. 
is no evidence that those children breast fed least an 
and insecure than those breast fed longer”; and, again, "in the group of 
normal children studied . . . there is no relation between duration of 
breast feeding and child personality at the nursery school level, nor 
with adjustment at the pre-adolescent stage" (118, pp. 63, 65). 

All five investigations, it should be noted, agree that there is no 
linear correlation to be observed between length of breast feeding and 
any major aspect of personality studied. 
conclusion that present eviden 
Hoefer and Hardy, and Pe 
medial duration of breast f 
"normal" personality 
or prolonged period o 
tradicted by the Masl 
as to the validity 
that extreme cautio 


ho were breast- 
at the breast for over 


ast feeding 
They found “There 
e more maladjusted 


ata 
eeding is more apt to be associated with 
her a brief 
flatly con- 
arring questions 
oyed, it is clear 
alization applying 
€ and unrepresentative 
ies were based. This is a 

The danger « e illustrated by Levy’s 
statement that avoring rejection of the child 


we 
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long-fed group were “‘over-protected’ by their mothers. There is no 
reason to dispute the accuracy of this finding or the reasonableness of 
Levy's generalization. But Peterson and Spano tested the generaliza- 
tion in their study of nursery school children, and concluded that 
"duration of breast feeding is not related to maternal rejection and 
cannot therefore be taken as an index of rejection" (118, p. 65). There 
may, of course, be some error in their procedure, although this is not 
apparent from a reading of the report. But is it just as likely that the 
groups studied differed in certain essential cultural characteristics, and 
that the contrasting generalizations advanced by Levy and Peterson 
are each valid only for populations which resemble the original groups 
in these (unexplored) characteristics. 

Self-demand vs. scheduled feeding. The prevalent opinion in many 
circles is that self-demand feeding in which nursing always rewards the 
child's crying is better calculated to produce a confident and unneurotic 
personality than is rigidly scheduled feeding in which the clock rather 
than the infant sets the nursing time. The following quotations illustrate 
this viewpoint: 

To the white child, whose feeding and other routines are rigidly scheduled, 
the mother or nurse . . . must appear incalculable. . . . He must develop a feel- 
ing that each individual is alone in life. 


To the Navaho baby, on the other hand, other persons must appear warmer 
and more dependable, for every time he cries, something is done for him (85, 


p. 31; 82, p. 57). 


The parent who responds to a clock rather than to the behavior of the child 
is, from the child's point of view, not responsive at all. . . . If the rewards which 
a child receives bear no consistent relation to his behavior . . . an apathetic or 
an anxious or a hostile individual is likely to result (110, p. 90). 


Such statements must be regarded as hypotheses not facts; and, 
so far as the writer has been able to gather from a rather thorough search 
of the literature, except for two studies to be noted forthwith, they have 
never been subjected to any empirical test. A great deal of reading-in 
of adult emotion into the infant situation is evident in these interpreta- 
tions. It is well known that only a few years ago the best "scientific" 
opinion favored scheduled feeding. to promote healthy personality 
development in the child, and this view is still fairly widespread in the 
medical profession. Certainly, even very young infants learn to adjust 
their sleeping and eating habits to a schedule of regular feeding, and 
it would seem that this regularity might just as much serve to promote 
a feeling of security and confidence on the part of the child as any other 
feeling. Here, too, we lean to the belief that a specific discipline does 
not exert a specific, invariant psychological influence upon the child, 
and that its effect can be gauged only from study of the parental atti- 
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tudes associated with its administration, of the child's constitutional 
endowment, and of the entire social-historical situation in which the 
ent personality is located. 

ehe ens studies to which we have referred have not been fully re- 
orted, but the published account is such a choice specimen not only 

of difficulties of the particular problem with which we are concerned but 

of social science methodology in general, that we reproduce it nearly 

verbatim: 


Miss Trainham . . . reported on a study begun at the Merill-Palmer School. 
Two groups of children were chosen, one group on self-regulating feeding 
regimes, and the other not self-regulated. The parents of both groups were 
quite similar. ... One group consisted of ten subjects and the other twelve. . . 
the results suggest in a very striking way that the self-regulated children are in 
all categories of the tests more advanced in their development. . . , 


Dr. Escalona had conducted similar studies . . . and had arrived at opposite 


ps. One group consisted of 
€ other group consisted of 


a theory to “explain” a par- 
ain all the relev 


ocial scientists today is a minimum 


n groups that they study 
and the social scientists— 
d to them in the artificial 


the history of 
aboratory, 
tory of the 


unrepresentative groups, can o 


nly lead to such 
one described above. 
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Weaning. The writer has been unable to find any experimental data 
correlating different forms and times of weaning with aspects of per- 
sonality, but a brief discussion of the question may nevertheless be in 
order. As an event which, in Western society, usually occurs within a 
relatively short space of time, weaning can be classified as a trauma. 
However, in keeping with his tendency to universalize Viennese cultural 
norms, Freud did not see the culturally relative nature of weaning, and 
conceived of it, instead, as an invariably painful situation for the child: 

It looks . . . as if the desire of the child for its first form of nourishment is 
altogether insatiable, and as if it never got over the pain of losing the mother's 


breast. I should not beat all surprised if an analysis of a member of a primitive 
race, who must have sucked the mother's breast when he could already run and 


talk, brought the same complaint to light (46, p. 166). 


This view that the infant has an “insatiable” yearning for the breast 
can only be regarded as an adult reading-in of emotion into the young 
infant. It is difficult to see why the purely physiological sucking capac- 
ity of the neonate can not be satisfied as well upon an artificial nipple 
as upon his mother’s breast, or why a child who has never sucked his 
mother’s breast need ever be weaned from it. Examination of the 
weaning trauma suggests that it may be regarded primarily as a product 
of a widespread (but not universal) social situation in which the child 
has come to associate not only nourishment but also warmth and affec- 
tion with the mother’s breast, and in which separation from the breast 
creates a fundamentally different social situation to which he must 
adjust.® 

When weaning disrupts a series of satisfactions which the child has 
been receiving with regularity from infancy, it would be surprising if 
he did not react emotionally to this deprivation. One might suppose 
that late weaning would be more traumatic than early weaning, since 
in the former case the child would be more firmly accustomed to the 
satisfactions of breast feeding and hence more loath to forego them. 
Against this supposition the psychoanalyst counters that the develop- 
ing child ego is better fortified to withstand the loss of the mother’s 
breast at a later age; and that early weaning creates a craving for the 
oral satisfaction which has been denied,® resulting in a pessimistic or 
sadistic character, whereas late weaning results in self-assurance and 
optimism (36, p. 489). But this is all ex cathedra doctrine to the ag- 


“In those children who receive their nourishment from a bottle given 
y do not learn to recognize the mother (nurse) 


hole loved object, the process [of weaning] is 


8 Cf. Bernfeld: 
them by so many different people that the 
and source of milk, bottle, as a coherent w! 


i d" (11, p. 298). 
i fuera: ere at at developmental levels that do not afford enough 


3 “One gets the impression th ; r 
Nr. the organism refuses to go further, demanding the withheld satisfactions" 
] 


(Fenichel, 36, p. 65). 
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nostic,” in the absence of scientific documentation. A en i he 
esis is ‘that weaning, whether it comes early or late, gent y or s E eb : 

t be said either to weaken or strengthen the child's sense o "S 
wm il the entire situation in which the act takes place and the 
ed Be: y ince in the parent-child relationship have first been 
v Aa ow some circumstances, it is conceivable that severe 
meee may help to produce a strong, secure ego "a on 
other circumstances, the same weaning might contribute to the 
ment of an insecure, anxiety-ridden personality. 

Thumb sucking. As thumb sucking by inf. 
quently been used as à diagnostic of oral depr 
anthropological investigations, it is desir 
of this criterion. . l | 

Levy (87, 89) has experimentally confirmed the young animal's 
drive to exercise its sucking function in various ways when this had not 
been adequately exercised in the normal process of nursing, and he has 
demonstrated that thumb-sucking in the human infant is oflen a response 
to sucking deprivation (86, 87, p. 203). Roberts (130) has supported 
the latter finding. Surveying these data, Sears concluded that 

On the whole, evidence supports Freud's view... that thumbsucking is a 
reaction to deprivation. when the nipple js with- 
drawn from an infant's m 1t condition, as a habitual 


activity, is inadequacy of opportunity to suck extensively in connection with 
the eating process (136, p. 4). 


as, under 
develop- 


ants and children has fre- 
ivation in psychiatric and 
able to appraise the validity 


There is absolutely no question that thumb sucking may be “a re 
tion to [oral] deprivation.” 


deprivation need not result i 
thumb sucking does occur, it 


ac- 
But is important to point out that (1) oral 
n habitual thumb sucking! and 


(2) when 
is not always attributable to or. 


al depriva- 
10 Some of the more devout have also expressed the opinion that the oral character is 
not clearly defined. See Fenichel (36, p. 488). 
" Cf. Bernfeld: 


sudden, frustration , . 
for the unfolding 


- would offer the possibility . . . 
of an energetic ego" (11, p. 299), Thel 


ogic behind this argument is that 
iginally merged with the child’s 
of the child's ego. Bernfeld 


ass as lower-class 


K correlated with the lesser 
dle-class child was afforded for sucking while nursing. 
13 Levy notes that “the s issi ild i 


: ield a finger sucking pleas- 
ure than an aggressive one 


; regardless of the quantity of ora] tension" (87, p. 206). 
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tion. As the latter point has not received the attention it deserves, some 
documentation for it may be offered. 

Thumb sucking has been observed at normal delivery (76, p. 84), 
and Bernfeld reports that it is present in all infants during the first three 
months. Thumb sucking has been noted for a minority of children in 
primitive societies where unlimited nursing on self-demand is the rule.!5 
An interesting case in our own society concerns twins raised on a self- 


demand schedule: 

Each infant was allowed to nurse at any time as long as desired, and nursing 
periods ranged from three minutes to fifty-five. Sherry ... never sucked her 
thumb while Peter sucked his almost constantly at four months and in moments 


of stress at thirteen months (141, p. 176). 


In a Chicago study, Davis and Havighurst found that 


Negro middle-class children are treated much more permissively than the 
white middle-class children with respect to feeding and weaning... . Yet the 
proportion of Negro middle-class children reported sucking their thumb is 
almost the same as the proportion of white middle-class children so reported 


(18, p. 707). 


Gesell and Ilg (52, p. 124), and Bakwin (6) have voiced their conviction 
that oral deprivation is a factor "of minor importance" in the general 
etiology of thumb sucking. The latter writes, "We have frequently seen 
babies nursed at the breast and seemingly satisfied in every way, who 
sucked their thumbs; and we have seen these babies develop later as 
well-adjusted children" (6, p. 100). 

An adequate statement of the relation between thumb sucking and 
oral deprivation, thus, appears, to be: (1) thumb sucking is often the 


... all doubt concerning the regularly-established fact that all chil- 


4 He writes: "' 
is absolutely excluded during the first 


dren suck their fingers even without being hungry 


three months...” (11, p. 73). 


15 Mead's unqualified statement that “no primitive child whom I have ever seen or 


heard of sucks its thu mb or fingers," which appears inanauthoritative handbook of child 
psychology (104, p. 676) and which was repeated at the May 29, 1948 meeting of the 
Society for Applied Anthropology in Philadelphia, can only be deplored as an inaccuracy. 
Dennis has noted that of 288 Hopi children 2-9 years old, 13 or 6% were definite cases of 
thumb sucking (22, p- 181); Ward Goodenough informs me that cases of thumb sucking 
are to be observed among children on the island of Truk, despite unlimited breast feeding 

r data could no doubt be obtained for many other 


throughout the first year; and simila: r i 
tribes if sufficiently accurate observations were available. The more moderate thesis that 
finger sucking is comparatively rare among many peoples with adequate breast feeding 


of children remains, of course, in line with the evidence. Thus Kluckhohn notes that 
among the Navaho “Thumb-sucking is very rare, and we have never observed ‘obses- 
sive’ thumb-sucking or à child asleep with its thumb in its mouth. When excited or 
disturbed, a child will not infrequently put the index or some other finger in its mouth 


(82, p. 54). 
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i sucking deprivation (the child desiring to gain a 
one mg bean denied; (2) it may also result from adn 
gratifications (the child desiring to perpetuate or reconstitute ap €— 
which has been gratified); (3) it may be associated with neit s . 
cient nor excessive, but with entirely normal sucking experience. That 
thumb sucking may also result from factors extraneous to oral gratifica- 
tion is not to be excluded. It is clear, in any case, that no single-factor 
explanation of its etiology will suffice, and that it cannot, therefore, be 
used as an unqualified index of oral deprivation. 


“ Mothering" 


Ina series of widely quoted publications, Ribble has w 
about the importance of adequate "mothering" to the development of 
sound personality and organic health. By "mothering" she means the 
tactile contact, cuddling, and stimulation of the infant "by one indivi- 
dual" (127, p. 459). Such mothering, she contends, "primes the breath- 
ing mechanisms," improves “the tone of the gastro-intestinal tract," 
"fosters functional integration," relieves "muscular tension,” strength- 
ens the infant's “sense of equilibrium," and gives him “a sense of secu- 


axed rhapsodic 


the infant is said to have a 
Which helps organize his respiration, 


and of the 
appears,” 
etc. (125, 126, 128, 129), 


It is unfortunate th 


1 of the neonate is developed at birth is, 
of course, well known; and, in the sense that most mechanisms are made 
sed, it 


may perhaps be said that a "drive" to exercise its sucking 


16 Cf. Bernfeld: “Compairé emphasized stron; 


for the child’s development; indeed, he become 
^a 993 


gly the significance o 


f the mother's voice 
NEC a 
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function exists at birth independent of the infant’s need for nutriment 
Ribble asserts that this drive must be encouraged and satisfied if the 
infant is to survive and prosper, stating that “Fifty per cent of the 60( 
babies in our . . . study were definitely not ‘self-starters’ in sucking’ 
(129, p. 628). Other authorities, however, have stated that ‘Sucking 
is present in all infants after 24 hours" (137, p. 64); and there can be 
little doubt that spontaneous thumbsucking, which is all but universa 
among infants, can serve to satisfy whatever physiological “need” may 
exist for exercise of the sucking function, where this is not satisfied ir 


the normal process of nursing." (See the previous section on thumt 


sucking.) 

It seems to us, therefore, that Ribble takes too hysterical a view ol 
the neonate's organic and psychic resources, which are considerably 
greater than she allows and which enable it to thrive under a much wider 
range of care than the modern American pediatrician advocates to the 
American middle-class mother.!* The same point can be made in regard 
to the supposed need for “mothering” during the first few months of 
life. The human infant is obviously incapable of surviving without adult 
care and protection and, if one wishes, one may thus speak of his organic 
“need” for such care. But the minimum needs thus defined are probably 
much less than the pediatrician is apt to prescribe. 

The pediatrician's prescriptions are, in general, designed to recreate 
the sheltered position which the fetus occupies in the womb.!? We do not 


17 We are, however, by no means convinced that the activity of sucking is physio- 
lly necessary for infant health, but only that it is the normal complement to it: 
i.e., we believe that infants restrained from thumb sucking and fed intravenously could 
be potentially as healthy as normally suckled infants, and would probably even lose their 
sucking urge, just as it has been found that “if chickens are fed from a dropper for a long 
period of time after birth, they lose the pecking urge" (Levy, 92, p. 659). A study by 
Sears on cup feeding of infants seems to support this view: “In general, the results indi. 
cate that breast feeding produces an oral drive, or at least strengthens it, and cup-fec 
children do not have as much oral activity during the first ten days of life as breast fec 
children.” (Personal communication. A report of this study is in press in Pediatrics. 

18 There are, of course, pediatricians of a broader outlook who are aware of the fac 
that infants in other times and places have managed to grow to healthy maturity withou 
having enjoyed the care and advice of pediatricians. Gesell and Ilg are among these 
“Fortunately the factors of developmental safety are 80 great that most children weathe 
the everyday crudity of our methods of management (52, p. Ere - à 

19 "The manifold prescriptions which science gives the mot! ier for the care o th 

-born can be summarized in one phrase: warmth and protection from stimuli. Thi 
new € t our care of the infant attempts to reconstruct and strengthen the psychc 
ve condition in which we found the foetus" (Bernfeld, 11, p. 3). “... natur 
perpe n ahead of science, and with swaddling clothes, and the cradle, which we 
ae pea he amniotic fluid, created an artificial substitute for th 


to imitate the bobbing about int 
uterus which had been abandoned” (Alexander, 4, pP- 123-124). 


logica 
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i imply that this treatment of the infant is not good for it. But 
sim ai he observed, there are many cultures in which infants are 
a quer to fomifieimud which, in almost every particular, 
etur che "etoile" prescriptions of the pediatrician, and in which 
a grow up into normal, healthy cim i l " 

i ils i a composite description, we gain a co = 
Pe eorr arra the i Seon d of the foetus situation. 


i i i ater; it is placed on a flat pad with- 
ild . . . is bathed or washed in cold water; it is p 
on or is swaddled or left to lie motionless (11, pp. 6-7). 


Indeed, Bernfeld feels obliged to coin the term 
describe the type of infant care prevalent i 
cultures where ‘no attention is paid to i i 
and ... the babies are caked with dirt” (37, p. 83), where the infant s 
head or feet are deformed, or where its nose, ears, or mouth, are ie 
to piercing and its body to scarification or other painful mutilation. 
Until contrary evidence is forthcoming, there is no reason to believe that 
infants reared in a “foetusphobic” manner are any more neurotic or 
insecure, or differ in any significant aspect of personality from the 


"foetusphile" infants because of any single type of treatment t 
are subjecled.^ That foetusphobic i 


a high proportion of infants is quite 


"foetusphobic" to 
nso many societies. There are 
the filth which accumulates 


another matter; it may also ensure 


nts survive to maturity, whereas 


account of such treatment to 
al Flathead adult, e.g., 
; Viewed in profile, was only on 
Presses the belief that such 
earliest infancy, while the 


had a head 
e and a half 
deformation did not cause 
bones are soft and easily depressed” 


to two inches wide. Mason ex 
much pain "as it is done in 
(103, p. 173). 

?! Greenacre has declared that "heightening of the sa 
character . . . would seem to be the inevitable result” of 


- Such Statements are p 


t into the nature of 
the underlying sadistic 
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foetusphile practices encourage the preservation of infants with weaker 


constitutions.” 
Our argument has been directed against an exaggerated insistence 


upon the need for tactile stimulation of the infant by the mother for 
the maintenance of its mental and bodily health. We do not mean to 
suggest that some contact and, more important, affection from the 
mother or her surrogate is not necessary if the infant is to grow into a 
“healthy” individual (i.e., one conforming to an ideal-typical norm held 
by Western psychiatrists). Spitz has shown that when nursery children 
six to eight months old have their mothers removed, some of the children 
develop depressive symptoms, which he attributes to their loss of love 
object (138). Ribble describes negativistic or depressive states arising 
in children deprived of their mothers which, apparently, have a similar 
etiology.? The high death rate in foundling homes, despite satisfactory 
hygienic conditions, has been cited as evidence of the child's need for 
attention and love if it is to survive; however, it is unclear whether love 
or individualized care is the crucial desideratum here (any uniform 
institutional regime which is not adjusted to the different constitutions 
of its members is apt to lose those members whose constitutions can not 
adjust to it). 

A crucial experiment conducted by Dennis (19, 20) throws some light 
on the question of how much and what kind of attention the infant 
requires if its physical and emotional development is to proceed un- 
impaired. He raised twin girls from birth in conditions of extreme 
isolation. The infants were separated from each other by a screen and 
received only the purely minimum attention required for feeding and 
bodily care, being kept supine on their backs and not spoken to or 
fondled or played with or exercised in any way until they were seven 
months old. Because of their restricted practice, visually directed reach- 
ing and grasping, sitting alone and standing with help were retarded, 
but otherwise, the children “yielded during most of the first year a rec- 
ord of development not distinguishable from comparison records of 
infants in normal environments” (20, p. 157). The children responded 
affectionately to the experimenter during his brief visits, so that Dennis 
remarks: “Fondling is not necessary for the development of interest in, 
and every sign of affection for, the adult" (20, p. 156). His over-all 


conclusions agree with the ones we have drawn: 


2 Bernfeld has suggested that the foetusphile ates of Lune te 
origin "primarily to the economic-political interest of the nations in large popu ations 


(11, pp. 7-8). 
23 Ribble (127, p- 461; 129, p. 633). Cf. Levy (88). 
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normal behavioral development can occur in some infants when most of 
the first year is spent under conditions of minimum social stimulation and of 
very restricted practice. In other words, a large number of acts on the part of 
the adult which have been held by some people to be of importance may be 
dispensed with (20, p. 156). 


Similar conclusions can be drawn from the investigation of two 
Viennese psychologists who compared the development of Albanian 
children raised under conditions of great physical restraint, with that of 
Viennese children (17). Until they are one year old, the Albanian 
children are bound securely to a wooden cradle customarily placed in 
the darkest corner of the room, often with a cloth thrown over their 
heads so that no light is visible. These children displayed poor muscular 
coordination, but once given an opportunity to practice, their perform- 
ance improved rapidly so that it was clear no permanent retardation 


An important consideration to be ke 


€n accustomed since 
ntion of their mothers. One wonders if the 


ennis' experimental 


easure that change, Elsewhere, she ildren 
who have been bound to the cradle “will differ i Mp Nis ee 


Occasions when they 
P ie i : R S , ance disturbance to which the 
individual is subject, etc. (105, p. 673). Th measurable indices, but they 
have not yet been demonstrated, į 
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care by a single adult is requisite for the development of “normal 
personality. Ribble, it will be recalled, makes this assertion but th 
anthropologist will suspect it may be predicated upon the monogamous 
nuclear family system of Western society. Need the child who grows uj 
in a society with a strong, extended family maintain only one love objec 
during its formative years? Such a child cared for by siblings anc 
aunts would hardly be expected to develop the abnormally withdraw: 
personality common to so many orphaned and unloved children in ou 
own society. In short, Ribble and those who share her beliefs are no 
so much making an absolute judgment on the type of care which i: 
necessary for sound personality formation as they are making a serie 
of recommendations which, implicitly, are based upon the nature of th 
child's social environment in the Western family and upon the natur: 
of the family's position in Western society. Scientific investigations o 
desirable patterns of child rearing might proceed more successfully i 
the investigators were more conscious of this fact. 


Sphincter Training 


The anal-erotic, anal-sadistic or compulsive character is the mos 
clearly-drawn picture in Freud's album of characterology. According t« 
this picture, the infant derives a great deal of erogenous pleasure fron 
its bowel movements, and when this pleasure is inhibited by sever: 
bowel training, 

The usual result is reaction formation . . . in the form of a drive to limit al 
his primitive, impulsive, pleasure-seeking activities. . . - In extreme cases . . 
one becomes parsimonious, stingy, meticulous, punctual, tied down with petty 
self-restraints. Everything that is free, uncontrolled, spontaneous is dangerou: 
..» (114, p. 747). 


So widespread is the anal character in Western society that Rohein 
has argued that the entire culture "is based on sublimations or reactior 
formations of anal trends" (132, p. 283). Elaborating the Freudiar 
theory, some psychiatrists have formulated a system which directly 
correlates adult character traits with specific forms of sphincter training 


in childhood: 

...it appears that the child who has been trained in continence of urine anc 
faeces slowly and without pressure or punishment will yield control over hi 
eliminations without anxiety or resentment, learning to release without con 


26 Studies of personality development in orphanages and public institutions whicl 
Freud and Burlingham (42) and Goldfarm (55), among others, have conducted, thros 
some light on this question, but the abnormal nature of this environment in our societ; 
requires that generalizations from their findings be made with caution, 
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flict; but a precocious or harshly coercive training that forces the child, before 

he siolaically ready, to release to the outside demands, will set up resistance, 
3 comes retention as a defensive response and focus the child s behavior 
ran acquisitive or compensatory outlets for the denial of possession of his own 
eliminations (39, p. 22). 


Inline with this theory, Gorer (58, 59) has attributed many features of 
Japanese culture and character to the rigid bowel training of Japanese 
infants. 

However, when one comes to look for empirical verification of the 
influence of sphincter training upon personality development, evidence 
which would compare the personality traits of two groups of children 
whose home environment was similar except for the time and strin- 
gency with which sphincter training was imposed, the data are too 
meagre either to substantiate or to disprove the Freudian thesis. It 
is unfortunate that so little attention has been paid to this particular 
problem, since it appears to be the area of infant care in which experi- 


mental disposition of psychoanalytic precepts could most readily be 
made. 


Huschka (71) studied the reaction of 213 problem children at a New 
York hospital to the imposition of bowel traini 
"coercive" training which was started before 8 months or completed 
before 18 months, whereas bowel training initiated after 8 months and 
completed after 18 months was considered “adequate.” She found that 
children who were "coercively" trained manifested “undesirable” 
responses such as constipation, fear, and rage 
children who were “adequately” trained: 


ng. She considered as 


problem children. In a similar study, 


the reaction of 215 problem children to the institution of bladder training 
(72). 


Hamilton compared a gr 
or other anal-erotic traits in childhood with a 


's theory of the anal 

origin of these traits (63, pp. 467-472). 
These fragmentary and inadequate studies ons 
personality are all that we have b 
literature. Ardent believers in psych 
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“Despert found that early training produced an over-organized anc 
compulsive personality" (98, p. 411). Reference to the original paper by 
Despert (24), however, reveals nothing to confirm this judgment, which 
can only be interpreted as a wishful reading-in of conclusions unwar- 
ranted by explicit data. The same writer, reviewing a series of studies by 


Fries, states that 

... the role of habit-training was studied thoroughly. That it influenced be- 
havior was proved beyond doubt. ... It can be assumed that early training is 
very likely to have ill effects on emotional development. ... Untoward results 
are to be expected. What these results are in specific behavior has not yet been 
tabulated and set in conclusive terms. ... Nevertheless, it is conceivable that 
eventually more specific cause-effect data may be forthcoming (98, p. 410). 


Fries herself, however, subsequently drew a contrary inference from her 
work, stating that "any attempt to correlate age at which habit training 
was started with later personality traits, without consideration of the 
mother's personality type, would involve a large factor of error (49, p. 
92). 
In the light of the emotional argument in many psychiatric circles 
about the contribution of early sphincter training to the development 
of neurosis, a few words of caution as to the validity of this thesis may 
well be offered. Only fifteen years ago, an authority (149) writing in the 
standard handbook of child psychology advised that bowel training be 
started as early as six weeks, while another child psychologist stated 
that bladder control might profitably be initiated at 3 months (135, 
p. 74). The shift in opinion that has taken place in the past decade must 
be regarded as due to a middle-class fashion of the times rather than to 
any conclusive scientific validation of the preferability of later sphincter 
training. Against the swings of custom, of course, rational argument is 
futile. 

A common reason which is advanced in support of the desirability 
of late bowel training is that myelinization of the pyramidal tract is not 
completed until the child is at least a year old, and therefore earlier 
habit training makes demands upon the child which its organism is 
not yet mature enough to fulfill voluntarily; i.e., the child “has to rely 
on the local conditioning of the rectal sphincter by reflex action rather 
than on voluntary control" (71, p. 305). The relevant neurological facts 


are as yet, however, inadequately known: 

... much is still to be learned about even the neurological aspect of the opti- 
mum age at which to begin bladder training . . . it is not yet possible to make 
any definite statement about the time at which myelinization becomes com- 
plete.... [F. R. Ford] concludes that the sequence in which the function of 
various structures develops undoubtedly corresponds to the sequence in which 
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they become myelinated, but . . . some degree of function may appear in the 
Non fm of myelin. Also... Langworthy has suggested that myelinization may 
be the result of development of function rather than the cause (72, p. 256). 


In opposition to the orthodox Freudian's emphasis upon strict bowel 
training as the causal agent of anal character formation, 
advanced a view we are inclined to accept which explains 
traits as arising from a total social situation: 


Horney has 
anal character 


Sphere, one would under- 
nces in the early environ- 
pressed . . . a person does 
incter, but both are tight 
o hold on to what he has 


Kardiner has pointed out that in a society which demands early bowel 


training of its childeren, "cleanliness and orderliness are forms of acqui- 
escence to cultural demands, and are therefore forms of obedience" 
(79, pp. 43-44). Obedience, punctuality, cleanliness, ho 
other “anal” character traits ar 


e certainly inculcated into the child by 
middle-class parents of capitali 


St society, this syndrome of traits be 


character can be understood as an adjustment of the individual on the 
level of personality to concrete economic 
contrary contention advanced by Roheim 
to severe anal training, i 
hoard money, 


27 See Weber (144). Kr 
sketch of the distribution of 


1400's, "washed little and looked on bathing as rarely needed 1 i 
ous to health. ... This phase passed a i E M as likely to be danger 


e bourgeoisie, . ++ It was th ies in which 
the new wealthy middle class became specially į Setead eae 
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tance of social conditions in the channelling of libidinous forces, Fenich 
(35) has argued persuasively against Roheim's position on just th 
point. But even Fenichel holds that “anal eroticism produces the desi: 
to collect something," though adding that ‘‘What is collected is dete 
mined by reality" (p. 89). To speak of a general instinctual "'desire t 
collect something” as existing divorced from the material demands of 
specific cultural situation is, however, to invoke gratuitously a factc 
for which independent evidence can in no case be given. It is true, to b 
sure, that certain animals (e.g., rodents) hoard, but as the human anima 
does not exist separate from his culture, it is fruitless (untestable) t 
speculate about what might be his drives in a culture-less circumstance 
The more economical theory would appear to be to explain ‘‘anal’ 
personality traits in terms of the cultural universe which gives rise tc 
them. 
Restraint of Motion 

The problem of the child's emotional reaction to physical restraint 
has received a good deal of attention in the anthropological and psychia 
tric literature. Dennis (22), in particular, has advanced what appears te 
us to be an excellent solution of the problem, and his paper can be con 
sulted by those who wish a fuller documentation of our position. Green: 
acre (61) and Levy (92) have also contributed summaries of the litera. 
ture. 

In the course of a report published in 1917 which described variou: 
experiments they had performed on newborn infants, Watson anc 
Morgan declared: "If the face or head is held crying results, quickl; 
followed by screaming. ... Almost any child from birth can be throwr 
into a rage if its arms are held tightly to its sides" (143, pp. 166-167) 
This statement was repeated by Watson in many publications whicl 
appeared during the following eleven years. It gained credence fron 
an interpretation of the Freudian frustration-aggression thesis advancec 
by some scientists as a universal principle of human psychology: tha: 
aggression must always result from frustration.” Neither Watson's 
original finding nor the Freudian thesis, however, has (we believe) beer 
validated by empirical investigations. I : 

A number of psychologists tested Watson's observation in experi 
ments on newborn infants. Taylor found that in response to restraint o 
the arms no reaction occurred frequently enough to be called typical 


28 A list of these publications, 


. 202-203). . : 
oF 2»... . the existence of frustration always lgads-to-some form of aggression" (2: 
A ER 


p. 1). f^ p g 


appearing from 1917-28, is furnished by Dennis (24 
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and only 32 per cent of the infants tested made any response describen 
by Watson as a reaction to restraint (140, pp. 79-80). Pratt, 2 c» 
and Sun found that in 58 per cent of the cases tested, infants remainet 

ive in response to restraint of their arms: “no definite ‘defense’ or 
mic s tions were found in any considerable number of cases” (123, 
x Dai. elena studied the reactions of 150 newborn babies to the 
ine of mittens on their hands. Twenty-three babies got rid of the mit- 
tens; but most ‘‘showed no reaction unless the gloves were applied just 
before meal time.''?? . 

Prolonged restraint of infants is common and often very severe in 
cultures where it is the custom to swaddle the newborn child or tie it to 
a cradleboard during the first year of life. Among the Comanche, for 
instance, it is stated that "Children of six months when taken oft the 
cradleboards had arms as stiff as though they had been pinioned (95, 
p. 73). The Chinook child's head is bound to a cradle immediately after 
birth and the binding is not removed for a period of anywhere from 
three months to a year (103, p. 176). "Throughout the first year, Al- 
banian children are released from their cradles only once a day for a 
brief period of cleaning (17). The custom of infant restraint is very 
widespread: 


In both Americas the majority of abori 


ginal children were confined in some 
sort of cradle from their birth until they w 


ere able to walk about (103, p. 161). 


. . swaddling clothes were employed by the Jews, the Greeks and the 
Romans ...and were used throughout the medieval 


period in all European 
countries. Swaddling did not begin to disappear in England until 1750 (22, 
p. 209). 


Reports vary on the initial reacti 
restraint in the form of cradlin 
kicks, or otherwise protests ;32 
evokes no complaint. But its 


on of infants to the imposition of 
8 or swaddling. Often the child cries, 
at other times the swaddling process 
eems to be the unanimous testimony of 

*?? We have been unable to consult 
13, 496-502) and so this account is taken from Levy (92, p. 660). 

3l According to Pettitt, the cradleboard was used in every cultural area in North 
America north of Mexico, except for the Eskimo and a few tri 
United States; and, with few exceptions, by every tribe withi 

* Linton reports that Comanche children “always kick 
being wrapped” (94, p. 472); the Albanian child at least occ. 
into the cradle (17, p. 229). 

“Winifred de Kok... describes the swaddlin 
happy proceeding . . . ” (61, p. 212). . 

"The unanimous testimony of Hopi mothers affirms that w 
the cradleboard on the first day of life the process does not cau 
p. 95). 


the original reference (Revue Fr. du Pediatrie, 1937, 


n these areas (119, p. 12). 
ed and cried when they were 
asionally cried When strapped 


g of Italian children as an entirely 


hen the infant is tied to 
se the baby to cry" (21, 
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observers that once the child has been placed in restraint, it accepts 
this restraint without complaint during the first few months of life. (It 
is difficult to imagine what else the child could do but submit, when it is 
securely bound and crying is not rewarded by any loosening of its 
bonds.) In fact, when removed from their cradles during the first six 
months, Navaho and Hopi children often cry to be put back in them.*t 

As the child grows older and as it experiences increasing periods of 
freedom from restraint in the course of cleansing, play or exercise, it 
often will protest vociferously against the application of restraint. On 
the basis of these data, Dennis has come to the following conclusions on 
the child's reactions to restraint, with which we are in agreement: 

[The young infant]... has no unique pattern of reaction to restraint. He 
reacts to all strong and persistent stimuli with restlessness and crying. ... On 
the other hand, methods which achieve restraint of movement without steep 
gradients of pressure . . . do not produce restlessness and crying. ... Whether 


Or not restraint of movement will cause negative reactions in the older infant 
depends upon whether or not the restraint interferes with customary sequences 


Which have been built up. What is thwarting to the infant is not predetermined 
but depends upon post-natal events (22, pp. 211-212, 216). [Our italics.] 

In short, children who are accustomed to restraint do not experience 
it as a frustration; but once accustomed to freedom, the same pine 
may become a frustrating experience. If they have never esr 
restraint its imposition would, of course, generally be expecte o ES 
voke a hostile reaction. These are substantially the same "m i i 
to which Levy was led;” and he also adds the important cons! eratio. 
ill often cry to be put back into their cradles, and many 

re else” (85, P- 26). . 
= aiima to the cradleboard is kept ofi the 
to sleep” (21, p- 96). 


al he cries and does not go ( 
" the [Navaho] infant apparently begins to feel the 


4 “Young [Navaho] babies w. 
of them do not sleep satisfactorily anyw 
“When the Hopi infant who has become 
board for a longer period than is usu 
% “After about the sixth month . - 
í ill wai d" (85, p. 20). 
confinement a frustration and will wail to be release 2 ] i 
“Some Pueblo Indian infants at several months of à bim A aie Ma ae 
it isi he Indian intan a a 
on the cradleboard. But it is important to note that t 

off the cradleboard each day for an interval sufficient to accustom him to manual play 
and to other activities” (22, P. 213). Tm 

li i i nis reports tha 
In this connection, Denr pi e P oath 2 


in freedom later tried to strap her to t ard 2 H 
Struggled so strenuously that the mother finally relinquished the attempt, a general 


: i hild away from the reservation, where 
experie who have given birth toac à å 3 
E ee = wae and who have returned to their native village when the infant 
; 
was " (21, p. 98). . eT ' 

o -— ie drin creeping or walking, a restriction of Hate ac- 
tivities will be fel straint. The modification of activities before they have been ful- 
filled seem bi i paccm ' . Once experience of free activity has taken place, the 

e Si pper 
Situation is quite different” (92, P- 659). 


Hopi mother who raised her daughter 
t five months, but the child 
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that constitutional differences in the need of movement will influence 
the child's reaction to restraint.2® These considerations must be known to 
psychiatrists who argue that restraint of motion is inevitably frustrating, 
for some of the same psychiatrists use restraint to effect the sedation of 
their over-excited mental patients. There appears, then, to be no uni- 
form emotional reaction on the part of the child to a uniform type of 
restraint, but rather various reactions which differ primarily in accord- 
ance with individual constitution and previous experience. 


Infant "frustration" and "aggression" 

It appears from the discussion of restraint (and, we can surmise, 
what is true of infant reactions to restraint may also to some extent be 
true of responses to other disciplines as well) that an experience which 
psychologists often designate as "frustrating" may in reality not be 
frustrating at all to the infant. Two outstanding factors enter into the 
explanation of those cases in which infant restraint was not experienced 
as frustrating: (1) the previous experience of the infant conditions it to 
accept restraint as normal, and (2) restraint is only part of a total situa- 
tion in which the child finds itself and which contains many positive 
rewards (counterbalancing, apparently, whatever negative features may 
also be present). Thus, for example, the restrained Navaho child is 
protected "from the painful outcome of motor activity, such as picking 
up a hot coal or getting in the way of adults" and, at the same time, may 
actually participate in the social life of the family at an earlier age than 
does the uncradled American child? Contrariwise, there is no society, 


Cf. Hartmann, Kris and Loewenstein: "early swaddling . . . does not stimulate ag- 
gressive response, since it does not interrupt an activity but prevents one” (65, p. 25). 
These authors, however, also repeat the unjustified assertion that “By and large, the 
child tends to react to restraint by some manifestation of aggression" (ibid.). 

38 ! , , , the need of movement varies according to . . . innate differences, maturation, 
health, type of restraint, and emotional factors" (92, p. 662). ' 

Fries also lends support to the importance of constitutional factors in determining the 
child's reaction to restraint. "Children differ in their Congenital Activity Type. The 
difference in responses of the quiet, moderately active and active infant have been de- 
scribed in full by the author .... Infants of these three types approach similar situations 
in a different manner . . . a restriction of movement that is frustrating to the active child 
may not be so to the quiet child, whereas situations which are real obstacles to the quiet 
type are easily overcome by the active type" (49, p. 86). 

a9 |, . the cradle permits babies who could not otherwise sit up unaided to assume 
for long periods a position other than that of lying down, out of touch with what is going 
on around them” (85, p. 25). It might be added that the notion of an Indian cradle as 4 
comíortless prison deserves to be corrected. Nearly all cradles used by North American 


Indians contained dangling toys and objects to interest the child, foot rests suitable for 
exercising the legs, etc. (103, p. 162). 


—- 


p 
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even those in which unlimited breast feeding and the most lenient 
sphincter training are customary, which does not regularly inflict some 
pain and frustration upon its young. Even childhood experiences which 
the psychologist and anthropologist view as ideally "satisfying" and 
“unfrustrating” actually contain frustrating elements.‘ As Goldfrank 
has pointed out, “Pressures of some sort are exerted during infancy 
even in those societies which exhibit the greatest leniency in nursing or 
toilet training, and some degree of permissiveness is present even when 
infant disciplines are manifestly severe” (56, p- 517). 

We can, then, safely assume that infant experience is never uniformly 
“frustrating” or "satisfying" but always contains a greater or lesser 
admixture of both elements. The big scientific problem is how to estimate 
by objective means the extent to which any single situation, as well as 
the whole of childhood experience, may be considered weighted in the 

n the direction of satisfaction. Until this 


direction of frustration or i 
problem has been solved, the efforts of psychoanalysts and anthro- 


pologists to explain adu y in terms of infantile frustrations 
and gratifications can no ded as scientific, since subjective 


judgment rather than objec the weighting of 


infantile experience." : : 
Dollard and his associates attempt to define frustration objectively 


as "that condition which exists when a goal-response suffers inter- 
ference” and, again, “an interference with the occurrence of an instigated 
goal-response at its proper time in the behavior sequence 1$ called a 


It personalit 


t be regar 
tive procedure determines 


40 E,g., the Hopi e been cited as illustrative of societies 
which afford their children i ing early experiences. The Hopi child, how- 
: n i i its carrier's back and it is a regular 
4 A htly the first tim laced on its carrier s j J gula 
ves, drm ee i lips and then to withdraw this 


A ther to pla bits of food to the child's 1 
practice for the me P hi d is also often kept in semi-darkness, à cloth being placed 
aho, Kluckhohn reports that 


food (56, pp. 521-5247: 

over the face-guar radle (23, P 109). Of the Nav n 

“Teasing of children of nine months or à year and over 15 not infrequent—even by the 
R ildly sadistic forms. » ++ There are also delays in the 


mo v times 3 : 

ace be I even o ren. A child of seven or eight months may cry 
E ick H " 

fifteen or twenty minutes during t ore a mother picks him up (82, p. 62). 

On Okinawa, we are told, erized by placing burning leaves 

against the skin. Most Okinawan e fiery applications" (107 


p. 394). 


awans hav 


f pre-verbal child 


) ; . onted, at Yale University, with th 
ing and Irvin Child have experimented, à 2 MESH is ue 

of d m by observers of the “custom strength” of various childhood dis 
in epen ad i fubit strength" to which they give rise. The effort to objectify thei 
plines, and of the tuting a cultural bias for | 


i the risk of substi 
Si is commendable but it runs s : 
Laine y dns all American raters OF all American psychologists may agree i 


weighting a articular discipline 25 ugeyere” OF “Jenient” in terms of our values; Chine: 
i a A RUP S 

or Hopi Saati however, might weight the same discipiines differently. 
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frustration" (25, pp. 11, 7). According to this system, a discipline would 
not be frustrating if it did not interrupt any prior-established goal- 
directed activity, and by asserting that this is the case in regard to in- 
fant restraint, the evidence presented in the preceding section can be 
brought into line with the frustration-aggression theory. The theory 
accounts admirably for many experimental observations of infant 
behavior—e.g., that strength of crying following feeding frustration 
correlates with the strength of hunger instigation (p. 28). With “frus- 
tration" defined, however, the problem is now pushed back to the deter- 
mination of the strength of instigation and of what constitutes goal- 
directed activity. Both "instigation" and "goal-response" are defined 
abstractly,? but in practice their determination remains ambiguous 
in certain crucial infantile situations with which we are concerned ;? 
and circular (if no aggression appears, no goal- directed activity has 
been interfered with) rather than independent reasoning tends to re- 
solve the difficulty for proponents of the theory (as it tends to resolve 
difficulties for the proponents of most social science theories). 

When agreement can be reached that a given experience is frustrating 
to the child, we are still not justified in assuming that an aggressive 
response is inevitably called forth by the experience :*4 

In Freud's theory frustration as such is supposed to arouse hostility, Actu- 


ally, however, healthy persons—children as well as adults—are well able to 
endure a considerable amount of frustration without any reaction of hostility 
(69, p. 67). 

There are any number of frustrations that do not evoke aggressive response 
in the sense of discharging hostility against a social object or its surrogates. 
There are, for example, a number of experiments in which animals are frus- 


42 "An instigator is some antecedent condition of which the 
consequence . .. . strength [of instigation] is measured by 
gated response competes successfully with simultaneous] 
sponses” (25, pp. 3-5). 

"An act which terminates a predicted Sequence will be called 
goal-response may be defined as that reacti 


predicted response is the 
the degree to which the insti- 
y instigated incompatible re- 


behavior sequence" (p. 6). 
? E.g., we are told that even "birth may be an ev 
viously characteristic of the child in its intra-uterine 


; one passage in their work suggests 


i i “|. . frustration can conceivably 
occur during the birth process itself and at any time thereafter, This need not be the 


case, however, with aggressive responses as here defined. The first reactions to frustration 


may indeed be of a random character and may lack that destructiveness which is here 
posited for aggression” (25, p. 11). 


— 
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eer and in which such aggression does not occur. The sucking frustr: 

de ; s 1 T a- 

; = in infancy cause finger sucking or sucking of other objects rather than 
creased aggression. There is no proof that the so-called weaning traumas of 


Fari cause more aggression or even more phantasies of hostility... than 
n other children. The same may be said of all those frustrations that have to 


do with bowel and bladder control (93, p. 264). 

R The procedure of some recent analysts of "national character’ 
in dealing with this dilemma of what is frustrating to the child is par- 
ticularly objectionable from any scientific point of view. It consists of 
making judgments as to the nature of adult personality and then looking 
for infant disciplines which can be regarded as having the requisite 


frustrating or gratifying effect for the production of this personality; 
idgments about the nature of childhood dis- 


lence of personality traits which might 
on is then posited, but not proved. 
hich might disprove the 
hich will allow a second 
and achieve the same 
ss of science 


or, vice versa, of making jt 
ciplines and then looking for evic 
be fitted to them. A causal connecti 
At no point is any search made for evidence w 
thesis, nor are any objective rules laid down wl 
party to follow the same procedure independently 
results, (The analyst is thus safe in his lair, but the progre 


is scarcely advanced thereby.) 
One result is that the same childhood experience is arbitrarily read 
as having one significance for personality formation in one society and 
adleboard in the Southwest 


the opposite significance in another‘ —the cr: 
e.g., is said to have induced “passivity,” but among the Plains Indians, 


Or the same experience in the same society is inter- 
different significances for personality formation by 
Thus Erikson argues that the cradleboard of the 
d the earliest muscle expression may have left 


nchannelized energy to the teething period," 
and "sadism" of the adult 


the cradleboard for the early 


“aggressiveness.” 
preted as having two 
different observers. 

Sioux “because it limite 


a certain inheritance of u i 
and contributed to the “aggressiveness 


Siouan (30, p. 139); whereas “Belden credits 
development of stoicism in Siouan children, adding that the habit of 


leaving them bound in these cradles for hours on end, regardless of their 
crying, teaches them to bear life calmly’ (119, p. 12). 
How IMPORTANT Is INFANCY FOR THE 
STRUCTURING OF CHARACTER? 


we have discussed some of the known facts 
fic disciplines of child rearing and the 


30, 31, 32), and La Barre (84). 
his is perfectly possible; our criticism is di- 
les which guide interpretation. 


ons, 


In previous secti Y 
tween speci 


about the relation be 
8, 59), Erikson ( 
ds, to be sure, t 
nciate any ru 


4 Particularly Gorer (5 
46 On theoretical groun! 
rected against the failure to enu 
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development of personality. The present section surveys the over-all 
problem of the importance of the period of infancy in comparison with 
childhood for the shaping of personality. If it can be shown that major 
structuring of personality takes place after the first year or vear and a 
half, clearly the contribution of infant experience in general and of these 
disciplines in particular to subsequent personality formation has been 
exaggerated by some theorists. We believe that this is the case, and 
will call attention to some of the evidence that had led us to this con- 
clusion. The picture, however, is not entirely clear, and the honest 
investigator must confess that apparently discordant facts are faced 
at every stage of the inquiry. These facts must be reconciled, and not 
ignored, before any solution of the problem can receive general accept- 
ance. In this predicament, we may draw consolation from the remark 
of one student in this field that “If a psychologist of personality had to 
limit his discourse to theories that were securely proved he would have 
nothing to recount. In his realm there are no certainties" (116, p. xii). 

The biologic orientation of Freud's psychology and his concept of 


unconscious mental processes naturally led psychoanalysts back to the 
earliest period of infancy in their search for f. 


ality development." This movement was carried to its logical extension 
by Rank (124) who viewed the act of birth itself as the basic cause of 
future anxiety; since all infants experienced the “trauma” of passage 
through the vaginal canal, every child had some anxiety; especially pro- 
longed or difficult births might produce more severe anxiety states. 


Greenacre (60, 62) has recently revived this thesis, with some modifica- 
tions. j 


It is known that the unborn foetus r 
stimuli from outside the womb and tos 
but, of course, this furnishes no evide 
correlative emotional or psychological 


actors influencing person- 


esponds physically to certain 
ome behavior of the mother,*8 
nce as to the existence of any 


responses in the intrauterine 


41 The approach of biologists to the study of personality may be illustrated by Stock- 
ard: "The personality of an animal... depends primarily upon its nature or constitu- 


tion.... The biologist very well knows that the most important part of individual exist- 
ence for man is passed and over before birth" (138a, pp. 21, 26). The concept of the 
unconscious leads in the same direction, as Bernfeld has Observed:'',, , if one completely 
abandons . . . the equating of conscious and mental, 
trace the mental phenomena back to birth. Perhaps sometim 
trate beyond birth into the foetal being" (11, p. 212). 

15'* . , . following a hard day's work by the mother fre 
comes so active, kicking and changing its position, that it 
... The presence of fatigue toxin would seem, then, to produce a response in the baby 
through the mother's blood supply. ... The response of irritation and restlessness sug- 
gests the experience of discomfort for the foetus" (81, p. 185). 


e later we may... pene- 


quently the child in utero be- 
1s Impossible for her to sleep. 
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— Ly see the apparently contrary view expressed by Dunbar, 
; Ds .) Freud himself was critical of Rank's thesis, noting that 
has ever been carried out to determine 
whether difficult and protracted birth is correlated in indisputable fash- 
ion with the development of neurosis” (47, p. 96); he added, however, 
that the question was still open. It has been argued against Rank that 
"individuals who have been delivered by Caesarian section do not, as 
adults, differ temperamentally in any easily discernible way, from per- 
sons who have been normally born” (110, p. 87). However, this state- 
ment by Mowrer and Kluckhohn is, apparently, not based upon any 
empirical study.” The only such study of which we know was made by 
Pearson who compared the behavior of children whose birth took less 
than six hours with children whose birth took more than fourteen hours. 


His conclusion was: 
There seems to be a difference between 


time was brief and those whose birth-tii a 1 e 
somewhat more the picture of the “neurotic” child than the latter, which might 


indicate that a short birth was a more traumatic experience than a long one. 
But the duration of birth itself cannot be held responsible solely . ..* The 
child whose birth is short more frequently is the youngest and had very in- 
dulgent parents, while the child whose birth is long is the eldest, the oldest 
child of the sex or the only child (117, p. 288). 

en that in this case, the relation between duration of birth 
or neurosis was the reverse of that postulated by Rank. 
a available at present are clearly too meagre to allow an 
n of the question, and opinions must be formed in- 
of one’s general theoretical orientation and on in- 


ferences drawn from data that do not bear directly on the subject. 
The same can be said about the view of recent proponents ofa 
rooming-in plan for the hospital care of newborn babies that the com- 
mon hospital practice of separating infant from mother during the two 
weeks of the lying-in period “definitely interrupts - .. the initial sense 

these sensitive infants to anxiety” 


of security . - - (andl predisposes 
$ scientific meeting, two advocates of the new 


129, p. 631). Ata recent 
ses aie 4 the “tremendous” and "enormous ' adverse effect which 
separation from the mother during this period would have on the emo- 


u : e 
no trustworthy investigation 


the behavior of children whose birth- 
me was long. The former presents 


It can be se 
and anxiety 
But the dat 
empirical resolutio: 
stead on the basis 


onal and nervous reactions of infants delivered 
red to by Kenworthy (81, p- 181) who does not, 
rting data. s that “the Caesarian sectioned child is 
a d—he cries i kedly less irritated by the contacts of 
hild delivered through the birth canal." May one 


first-born € à 
Id is not less sensitized than the second-born child? 


19 Comparative the emoti! 


by Caesarian and n 
however, cite any SUP 
prone to be less sensitize 


handling, etc.—than the 1 ; 
then suppose that the Caesarian chi 
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tional development of the child.*? Such statements reflect only the sub- 
jective judgment of their authors, as is evident from the very form in 
which they are cast; there is as yet no evidence to support or disprove 
these assertions (and perhaps that is why the emotions are so frequently 
resorted to in propounding them). 

Analysis of the neonate state has been too much colored by a naive 
reading-in of adult emotions into the infant which is comparable to the 
anthropomorphizing views of insect behavior once current among an 
earlier generation of biologists. "When a child comes into this world, 
he comes in lonely and he is afraid,” one psychiatrist declares (9, p. 
582); another physician writes that the newborn is "a frightened, primi- 
tive little creature" (2, p. 593). Especially in psychoanalytic literature, 
it is often "stated quite confidently that the child at birth is capable of 
profound and intense feeling" (75, p. 752). We agree wholeheartedly 
with Hendrick's comment that these descriptions of the neonate repre- 
sent "far more projection of analytic theory and adult passions than 
scientific observation" (66, p. 33). 

To some extent, of course, inference is the only possible means of 
approximating to an understanding of the psychological condition of 
the newborn infant. But this inference should rest upon concrete ob- 
servation of neonate behavior rather than upon preconceptions as to 
the nature of the neonate state or extrapolations from a later period. 
The best observations we have to date indicate that the newborn infant 
knows neither anxiety nor confidence, fear nor happiness, but exists 
in an affectless and presumably consciousless state. Its behavior is 
characterized principally by a great amount of undifferentiated mass 
activity (74, p. 39). Its "resistance to many types of stimuli is very 
high; even during handling the infant will often fall asleep” (137, pp: 
74-75); most of its day is spent sleeping. “Wh 
or five days of age is dropped one or two feet 
perceptible response.... This is also true for 
stimuli, such as stroking or petting, 
show no overt reaction” (137, p, 145). 
response in the neonate have met wi 


en an infant below four 
it frequently shows no 


tions to the sound of a buzzer in 7 out of 8 newb i 
life. Pratt, however, has rejected her claim as invalid on technical 
and, after a thorough review of the literature, concludes that “T 


here is some evidence 
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" erp Saige — that “research workers, as well 
individual Saat te os eal a ea pama fee yc np] 
Jones and Burks are rabab cat 4 oe d a ud of life” (78, p. 4). 
tion of individualit ipe ^ dinem: e x € iin mi 
sensory thresholds fand wh Prin erm ese urge oe 

M , la at is interpreted as] emotional expressive- 
ness . . . ” (78, p. 5). Certainly there are important physical differences 
between infants at birth “in the maturity of the nervous system, in the 
way frustrations are overcome, in the response to the Moro test, and 
in the amount of activity” (48, p. 226). Fries divides newborn infants into 
three types, in accordance with the amount of activity displayed: “the 
quiet, the moderately active, and the very active" (48). Just how such 
physical differences correlate with later personality remains uncertain," 
although offering a promising field for investigation. Whether one desig- 
nates the physical characteristics of the neonate as its “personality” 
or whether one chooses to restrict the term to emotional and psycholo- 
gical phenomena which arise only in later months is purely a matter of 


definition. 

In the same ill-founded but influential article referred to previously, 
Watson and Morgan declared that the newborn infant evidenced three 
basic emotional reactions belonging to “the original and fundamental 
nature of man: fear, rage, and love” (143, p. 165). The “fear” response 
consisted of “a sudden catching of the breath, clutching randomly with 
the hands... blinking of the eyelids, puckering of the lips, then 
crying"; it was called forth by falling, loud sounds, ora sudden push 
(p. 166). The "rage" reaction, produced by restraint, has been discussed 


. 3 "a" “ ee 
in a previous section of this paper (supra, pp. 21-24). The “love” re 
“stroking, OF manipulation of some erogenous 


sponse was evoked by 
ned to experimental stimuli during the neonatal 


that certai ay be conditio 2 
period. The s ii sre difficult to establish, highly unstable, and cannot be set up in 
all infants" (p. 240). His view is echoed by Wenger. Conditioning in the neonate is 
unstable and not easily obtained . - + anticipatory mouth opening occurs at least by the 
Second month, but evidence of conditionability before that time 1s equivocal" (145, p. 85; 


cf. ; 

= don Se ae o to bea high correlation Liegen eps activity 
[in infancy] and ability to meet a new situation or to um oua c: o aadu grim 
or frustration” (48, p. 228). Support for her position peor e an RES urst's 
study of 200 Chicago children. They found that The c 3 ren who "us mos active 
when young, compared with those who were most quiet when young, ten o be: most 


; i t" (18, p. 706). 
SUB i ore aggressive, and less nea ] 
ive now, most punished, éd newborn infant Jacks personality. Although many of 


53 Allport, e.g., holds that : A : 
its Bears, un congenital, personality as such is not inherited. Only when the 
the environment, acting upon it and being acted upon by 


Original stream of activity meets P 122) 
it, do the first . . . incipient traits emerge" (5, P: ` 
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zone, tickling, shaking, gentle rocking, patting" (143, p. 167). From 
these three initial emotions, the entire gamut of adult emotional ex- 
pression was presumed to develop by conditioning and extension. It 
can be seen how Watson's theory complements the extreme psycho- 
analytic view of the neonate as an organism whose personality can be 
permanently structured by very early experience and conditioning, and 
how closely the writings of Ribble parallel this early statement. 

Just as his view of the infant's rage reaction to restraint has been 
rejected, however, so must the remainder of Watson's theory be dis- 
missed as unsatisfactory. It overlooked ''the possibility that some 
emotional reactions might not be elicitable at birth and yet appear 
later on as a result of maturation rather than as a product of condition- 
ing" (122, p. 235). Behavior which Watson claimed to have observed in 
response to specific stimuli was not confirmed by subsequent tests.» 

The most convincing attack upon Watson's theory was,made by 
Sherman (137, pp. 114-143) whose observations also illuminate the 
subject of adult reading-in of emotion into the child. He showed motion 
pictures of infant behavior to a group of graduate psychology students 
and asked them to name the emotion represented, the stimuli which 
produced the behavior having been deleted. No agreement was exhibted 
in naming the emotions. Similar conclusions were reached when judges 
were brought into direct contact with the infants (the stimuli which 
Watson used to evoke ''fear" and ‘‘anger’’ reactions were administered 
to the infant behind a screen, and the screen was then removed so that 
the students could judge the reaction). Sherman's conclusions deserve 
to be underscored: "In estimating the emotional behavior of infants, people 
tend to read into it their own attitudes toward the expected reaction. ... Fre- 
quently we ascribe to the child reactions of which he is not at all capable. 

xa Most persons judge the emotional behavior of an individual in terms of 
the stimuli which have produced the reactions" (pp. 122-123, 142).9 


54 See, e.g., Sherman's observation quoted above on p. 30; and the work of Dennis 
cited in the section on restraint. Repeating Watson's test for “fear,” Irwin dropped in- 
fants two feet; only two of 82 trials results in crying, no response was noted in 12 per cent 
and various limb movements were observed in 88 per cent of the cases (73, p. 168). 

55 One limitation should be noted in regard to Sherman's study. The behavior 
("emotions") which he asked judges to identify—"'hunger," "anger," “pain,” and "fear" 
—was all produced by disturbing stimuli. Had he attempted merely to distinguish 
general states such as "excited" and “quiescent,” the judges’ agreement would doubtless 
have been near-unanimous. 

A study by Goodenough (57) found that university students rated almost six times 
better than chance in judging emotions represented by photographs of a ten month old 


child, but at this age a greater ability to communicate (and detect) emotion from facial 
expressions would be expected. 
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As a result of 's work d : 
ata ol See tt n sae ttre ft 
$ : E o behavi 
2 € is its place they advocate a purely d enti = 
oui: of of organismic involvement in a response under definite 
eau [9 stimulation, age of organism, and so on" (122, p. 236). 
b sh ensus of opinion of competent students, summarized by Pratt 
end a it cpi review of the literature in this field, is that “It 
rena tain that such well coordinated reactions as those implied by the 
sen ove, rage, and fear, do not exist at this early age, [i.e. in the neonate]" 
an P 211, our italics). The present view regards emotions as de- 
ping gradually in the infant by a process of differentiation from 
generalized to more specific reactions (p. 208). A study by Bridges (12) 
of the emotional behavior of 62 infants in a Montreal hospital, for ex- 
ample, distinguishes only two amorphous states at one month — “ie 
Cited" and “quiescent.” At three months “distress” and “delight” are 
differentiated, and it is only subsequently that "anger," "fear," and 
affection” are manifested. One may reject this particular develop- 


mental sequence Or terminology: but the fact of differentiation from 
initially generalized responses and the importance of maturation" 


s the development of emotions seems to be generally conceded. ; 
The point we would light of the findings 


g child psychologists, e any justification 
Aa speaking of emotions ; of love p 
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certainly suggestive, evidence of having learned to wait four hours for feeding 
(100, p. 277). 


During subsequent months, a whole range of conditioned responses 
can be developed, which evidence greater stability as the infant ages; 
"retention of conditioned responses developed in infancy in some in- 
stances have been observed over periods ranging up to 7 months" 
(112, p. 376). 

Two interesting experiments performed upon rats by Hunt (70) and 
Wolf (148) lend considerable support to the view that infant experience 
is an important determinant of adult behavior. Hunt subjected one 
group of rats to food deprivation for a period of 15 days starting at the 
age of 24 days, and a second group to a similar period of deprivation 
starting at the age of 32 days. Both groups were then given unlimited 
feeding for five months, after which they were again subjected to a five 
day period of starvation, together with control animals who had not 
been deprived during infancy. The experimental rats in the 24-day group 
responded to the second period of deprivation by hoarding more than 
two and a half times as many pellets of food as did the control rats who 
had not been starved in infancy; but the 32-day group did not show any 
excess hoarding. Hunt interprets these results as showing that early 
infantile frustration has a greater effect upon adult behavior than does 
later infantile frustration (70, p. 359), a conclusion which would extend 
to the psychological sphere an observation that has been established in 
the biological (especially the embryological) sphere (sce Stockard, 
138a, pp. 45, 134 ff.). This, of course, is the firm logic behind the psy- 
choanalytic position. 

Wolf's excellent experiment (148) is of equal interest. He impaired 
the hearing of one group of rats by taping their ears and the sight of a 
second group by taping their eyes for several weeks during infancy. 
When the restraints were removed, the animals learned to see and hear 
normally as adults. But placed in a situation where they had to compete 
with control animals for food in response to a light stimulus, the animals 
blinded in infancy experienced difficulty in seeing and failed to get the 
food ; when a sound stimulus was substituted, the animals who had been 
deaf during infancy similarly failed. Animals whose functions were 
impaired for a period after infancy (and then restored), were not handi- 
capped in the exercise of these functions during the competitive situa- 
tion (pp. 33-34). 

Apart from the consideration of the applicability to a larger animal 


of findings made upon a smaller one, a question raised by these experi- 
$ joat 
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which reactions to a specific infantile frustration can be generalized and 
transferred to other situations. Murphy expresses the belief that 

à Studies on the influence of infantile frustrations upon subsequent drive and 
behavior structure... suggest a certain transfer and generalization to a genu- 
ine “personality configuration 1n the adult, whether animal or man... we 
may say that continuity is no longer merely the continuity of an item; it soon 


becomes the continuity of a system (114, pp. 723-124). 
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range of possible infantile experience. If one burns a die upon the in- 
fant's body, its stamp will endure throughout life. Perhaps there are 
intense and prolonged forms of experience during the first year which 
can similarly cast the infant's personality into a mold which cannot be 


reshaped by any subsequent events. .— 
In the normal range of infant experience, ^f. 
events subsequent to the first year or two of life have i" power to 
“confirm or deny" the personality of the growing infant, to perpetuate 
or remake it, depending upon whether the situation of later childhood 
perpetuates or alters the situation in which the infant was reared. Since 
this is one of our most crucial theses, and bears heavily on the problem 
of the relative importance of cultural as against instinctual factors in 
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personality formation, we will devote some space to its documentation. 

The picture of the infant obtained from many psychoanalytic ac- 
counts has much in common with the medieval notion of the infant as 
a homunculus or miniature adult. A scientific picture of the infant dur- 
ing the first year of life, however, does not support this notion, as All- 
port makes clear: 


[During the first year]... there are certain conditions that prevent the 
formation of dispositions as stable as those to be formed later. There is, for 
example, a low degree of retentivity for conscious experiences. . . . Then too, 
though the infant learns rapidly, he also forgets rapidly. Habits... are readily 
lost. ... The propensity for an all-or-none type of emotional activity prevents 
his learning discriminative affective responses or the development of a hier- 
archy of likes and dislikes. His capacity for conceptualization is slight. There 
are, then, plenty of reasons why personality should be less stable, less predicta- 
ble, and less consistent in the early months of life than at any later time. It is 
indeed never again as unorganized and unstructured as in infancy (5, p. 130). 


The following evidence suggests the vital importance of post-infant 
experience to the formation of personality: 


1. The personality of adult Pueblo Indians is marked by “maladjustment” 
and "apprehensiveness," despite an infancy characterized by unlimited in- 
dulgence (56). 

2. "Navaho Indian infants receive a maximum of protection and gratifica- 
tion, but the anxiety level among adults is very high—presumably because of 
post-infancy traumas” (110, p. 96). ‘Navaho children who leave the hogans 
calm and well-poised return at the end of the first school year nervous and 
tense... the theorists . . . claim too much for the earliest years and do not 
pay enough attention to later events and to the total situation in which the 
mature person finds himself" (85, pp. 68, 111). 

3. " . . . the Kaska personality norm is introvert and the Haitian extrovert, 
although in both societies babies are fed when they so desire, receive no pressure 


to control elimination, are comforted and handled when irritable o: 
suffer no discipline for emotional willfulness. 


teria than those associated with physiolo 
structuring, and moreover, that these 
fant experiences" (142, p. 575). ` 


4. St. Thomas children of the lower economic class are reared with great 
indulgence: “Breast feeding is ordinarily continued for ten or twelve months 
and not uncommonly for as long as two years. No regular feeding schedule is 
held to; the child is fed when itcries. .. . The training of sphincter control is not 
begun until relatively late, often not until the age of three or four years and 
very unsystematically then." Nevertheless, “there typically emerges an inse- 
cure, suspicious, egocentric adult” (14, pp. 43, 89). 

5. “The pathogenic effect of later impressions is firmly established, and it 
can be assumed without danger that a severe traumatic influence during the 
war, for example, might set up a traumatic neurosis in a person whose birth 
had been quite normal. Indeed, under a sufficiently Strong traumatic influence 
the healthiest of individuals can become neurotic” (4, p. 114). 
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6. Some American ethnic groups "were giving children more of the kind 

warmth and consistent mothering which psychiatry and pediatrics have = 
cently come to consider desirable than were the completely American Monde 
Se The unspoiled maternal attitude has much to counteract, however, vies 
it confronts the instability of depression years... a satisfying infancy does n a 
necessarily compensate for economic deprivation in the next ten years, aad 
there is increasing evidence that later gratification may go far toward offsetting 


the effects of early frustration” (115, pp. 656-657). 
7. In most societies, male and female infants are treated substantially alike 


for the first year or two of life, but differences in behavior are increasingly 
manifested in childhood and later years. Whatever behavioral and psychologi- 
cal differences exist between adults of different sex, then, must be attributed to 
biological factors and the differential cultural conditioning of post-infant years. 


Faced with evidence such as that cited above, some of which they 
have contributed themselves, Mowrer, Leighton, and Kluckhohn at- 
tempt to salvage a part of Freudian theory by reasoning that 

Infantile indulgence probably does constitute the firmest foundation upon 
which, if later circumstances are reasonably favorable, a secure and confident 
adult personality can be developed. But it affords only a possible basis; it does 
not, in and of itself, promise fulfillment . . - infant experiences, while placing 
certain constraints upon personality, give mainly potentialities. . . - Whether 


these potentialities become actualized or not... depends upon later social 
conditions . . . only an external condition of adult life brings out fully a pre- 


disposition, the basis for which has been laid in the experiences of infancy and 
early childhood (85, p. 11; 110, pp. 95-96) 
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in any case, too large to be fruitfully delimited on the basis of present 
knowledge. Instead of introducing teleological terms, it would seem 
preferable merely to describe the nature of organism and environment 
as precisely as possible at all stages in their history of interaction—in- 
fancy, childhood, adolescence, and maturity—allowing for the likeli- 
hood that (a) different infantile experiences acting upon different con- 
stitutions may produce the same personality type, (b) the same in- 
fantile experience acting upon different constitutions may produce 
different personality types, (c) the same childhood experience following 
different infantile experiences may produce similar personality types, 
(d) different childhood experiences following the same infantile ex- 
perience may produce different personality types. 

We conclude that the rigidity of character structuring during the 
first year or two of life has been exaggerated by many authorities, and 
that the events of childhood and later years are of great importance in 
reinforcing or changing the character structure tentatively formed dur- 
ing infancy? Or one may substitute Horney's formulation: '.. . the 
sum total of childhood experiences brings about a certain character 
structure, or rather starts its development. With some persons this 
development essentially stops at the age of five. With some it stops in 
adolescence, with others around thirty, with a few it goes on until old 
age" (69, p. 152). 

DISCUSSION 


It can be conceded that social scientists have failed to produce a 
definitive answer to the question of the relation between infant dis- 
ciplines and character development, because of a general lack of his- 
torical and cultural sophistication, the diffculty of establishing the 
validity of the personality measurements employed, and the difficulty 
of isolating single factors for study. It is hard to see how the last ob- 
stacle, in particular, can be overcome. Social phenomena can not 
readily be subjected to the type of crucial experiment which enables 


5 Beaglehole (8) has proposed a useful distinction between the “primary” character 
structure formed in infancy, and the “secondary” character structure which may be 
formed in later years: “The integration-determining structure of needs and feelings con- 
structed during infancy is generally reenforced and supported by the continued con- 
ditioning of the middle years of childhood and adolescence. ... Hence this original 
character structure may very well be called the primary character structure of the 
person. Under normal circumstances and in relatively static societies it will be the only 
character structure that the person possesses. In certain societies lioweyer the per- 
son may very well build or be forced to develop, either for Bébmanenk e temporary 
change, a secondary character structure which will enable him as a person to react 
dynamically and adaptively to a new set of major directives” (p. 149). 
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oo that personality is not the resultant of instinctual infantile 
hu a drives mechanically channelled by parental disciplines, but 
ather that it is a dynamic product of the interaction of a unique or- 
TER undergoing maturation and a unique physical and social en- 

ronment. This view is in substantial agreement with the position 
which Horney (69, P- 70) and Fromm (51, p- 286), among others, have 
advocated. Aside from meeting the facts of personality formation better 
than does orthodox Freudian theory, it has the advantage of being able 
to relate character structure to à concrete historical situation, which 
Freudian theory has been unable to do: 
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this is true of attitudes, must it not be equally true of infant experience, 
that no experience can be meaningfully isolated from the biological- 
psycho 'ogical-cultural context in which it is embedded; that no dis- 
cipline has an automatic, invariant emotional effect upon the infant? 
There is nothing new in this idea. Fromm (51, p. 293), Horney (68, 
p. 80), Kardiner (80, p. 224), Fries (50, p. 14), Lois B. Murphy (115, 
p. 654), Mowrer and Kluckhohn (110, p. 96) have all said the same thing 
at various times. 

A particularly clear statement of this view was made by Pearson 
in 1931, after an intensive study of 72 problem children at the Phila- 
delphia Child Guidance Clinic. The question which Pearson posed was 
whether the child's relationship to the parents, or the physiological 
events of its life were more important in shaping its personality. Under 
"relationships to parents," the attitudes of each parent toward the 
pregnancy, toward the child, and the position of the child in the family 
were determined; under "events," the health of the mother during 
pregnancy, the duration of pregnancy, length of birth, duration of nurs- 
ing, duration of weaning, method of weaning, and method of bladder 
and bowel training were recorded.” His conclusion was that: 

... the parental attitudes exert a more important influence on the formation 
of the child's personality than the actual events. ... 

There is no question that the physical situations of a child's life bear a 
definite relation to the formation of his personality, but the number of associ- 
ated etiological factors renders it impossible to consider any single situation as 
causative, This study does indicate, however, the marked importance of pa- 
rental attitudes. ... The child forms most of his ideas about the world of 
humanity from his impressions of his parents, and their attitude to him... 
must affect the formation of his personality to a far greater degree than the 
length of his birth or the duration of his breast feeding. The events of a child's 
life enhance and crystallize the parental attitude toward him... and the child 
reacts to the parental attitudes through its association with the events of his 
life rather than to the events themselves (117, pp. 287, 290-291). 


If this hypothesis is correct, the importance which Freud ascribed 
to traumatic events of childhood in structuring personality was greatly 
exaggerated. Several investigators have reached the same conclusion. 


680 Cf. Davis and Havighurst (18) who, interviewing 200 well-sampled Chicago fami- 
lies, recorded similar data on child training and personality with the object of testing for 
any correlation between the two. Their conclusion was negative: 
relationships appeared" (p. 709). 

& E.g., Sears: "Several sources of evidence indicate . . . . that Freud seriously over- 
estimated ... the importance of childhood sex aggressions . . . childhood [sexual] ex- 
periences are outlawed in our own culture and the child who has them, either willingly or 
unwillingly . . . is made to feel guilty or ashamed . . . the guilt and anxiety may in some 
instance lead to actual neurotic symptom formation. This would naturally be attributed 
to the experienced event rather than to the emotions involved" (136, pp. 36-37). 
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morphic view is the most difficult error to guard against in studies 
of infant psychology. 

Many writers speculate about a process of “empathy” or ‘emotional 
contagion” whereby the young infant senses the mother's attitude to 
it by means of subtle physical expressions, body tensions, or some less 
material signs. At times the speculation hovers on the borders of 
mysticism; we have seen little evidence that such empathetic transfer 
takes place in the very first months of life. The supposition is intri- 
guing, however, and if it can be reduced to behavioral terms and con- 
firmed, it would provide a vital missing link in the mother-child chain 
of personality communication. There is a good deal of evidence that 
subtle behavioral cues to maternal emotion are detected by the child 
in later months of life, and that these cues may be more important in 
governing its character development than are the gross patterns of dis- 
cipline which an observer may quickly note. 


SUMMARY 


This paper reviews some of the empirical data bearing on the theory 
that various features of infant care determine adult personality. Our 
conclusion has been largely negative, and we have been led to substi- 
tute a theory which emphasizes, instead, the importance of constitu- 
tional factors and of the total cultural situation in personality forma- 
tion; the importance of post-infantile experience is also indicated. To 
the criticism that this paper stops where it should start, with the defini- 
tion of specific situations which can be calculated to produce specific 


psychological effects upon individuals with certain constitutions, we 
can only agree. 


5*5 See Dunbar (28, p. 157), Escalona (33, p. 80), Greenacre (61, p. 209), Lundberg and 
Farnham (97, p. 305), Maslow and Szilagyi-Kessler (102, p. 84), Mullahy (111, p. 178)» 
Plant (120, p. 96), and Sullivan (139, p. 8). 

9 Some of the most convincing evidence concerns the reported transfer of food 
aversions from nurse to infant. Escalona (33) notes that infants' acceptance or rejection 
of orange or tomato juice depended upon which juice the person feeding them liked or 
disliked, as well as upon the feeder's attitude to the infant. Similarly, Dunbar declares 
that “there is probably no point in prescribing for the infant any food substance for which 
the mother, nurse, or whoever is in charge of feeding, has a definite dislike” (28, p. 157). 

67 Dunbar, however, suggests that the psychological state of the mother can even be 
communicated to the unborn foetus: “The physiology of the mother is changed when she 
is under emotional strain and the effect of these changes is transmitted to the foetus 


through the placental circulation and in other ways" (28, p. 156). Cf. Greenacre (60). 
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A NOTE ON GRANT'S DISCUSSION OF THE LATIN 
SQUARE PRINCIPLE IN THE DESIGN OF 
EXPERIMENTS 


B. R. BUGELSKI 
University of Buffalo 


'The suggestions made by Grant! that researchers might find a vari- 
ety of uses for the Latin and Greek square arrangements of variables 
are especially to the point for psychologists concerned with learning 
problems in which a series of materials is learned in succession by the 
same subjects. Although Grant points out the similarity of principle 
between the Latin square and the traditional ABBA arrangements used 
commonly to control practice, fatigue, and other cumulative variables, 
there are some features of special interest in rote learning experimenta- 
tion that were not brought out in Grant's general treatment. 

The learning of a single list of nonsense syllables. The problem of 
which items in a single list are learned first, last, etc., is usually attacked 
by use of a list containing syllables of allegedly equal association value. 
Although individually the syllables may meet some criterion of low asso- 
ciation value it may turn out that a given syllable is more or less easily 
learned depending on which syllable it follows. The writer has accumu- 
lated evidence to show that a given syllable has different learnability 
depending not only on its position in a list but also on what syllable it 
follows. Any claim made about positional effects based on the k of 


i i i hes 
either to use many different lists or to use t 
orders and arrangements SO that each syllable follows every other 


: M h 
syllable equally often, and in each and every ordinal pode. md 
six-syllable list could be expanded into a latin square m e d 
lists in which each column represents one arrangement S owing 


in which the separate syllables appear, as follows: 


12345 6 
$3 46 1 3 9 
362 5 1 4 
415262535 
Sg 3 0.2 2 
65432 1 


in each posi- 
it i arent that e pears in 
ri "s t always precedes and follows a different syl- 
ie yay any combination of syllables 


. In this w j y 
d he learning of either or both is uniformly 


ple in the design and analysis of psyc 


In this square ach syllable ap 
tion only once and 
lable in each list (colum 
that would tend to facilitate t 


The latin square princi 
. Bull., 45, 427-442. " 
P 49. 


hological 
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distributed among the subjects. Each syllable also appears in each 
position thus controlling any advantage that might accrue to a syllable 
from position alone. . 

The learning of successive lists of syllables. In this type of problem 
a subject may be asked to learn one list of syllables or problems after 
another. The several lists should be thought of as comparable to in- 
dividual nonsense syllables in a single list and the lists should then 
be arranged as above to avoid the possible cumulative effects of some 
one problem or list acting more positively in some other problem or 
list, thereby giving a spurious effect which might be attributed to posi- 
tion or to some other variable. In a proactive inhibition experiment, 
for example, it might make a difference whether list 4 follows list 5 
or 6 or some other list, and if such other list is chosen at random, the 
results may be open to a cumulative effect error. 

Odd-numbered vs even-numbered squares in experiments with nonsense 
syllables. Some of Grant's illustrative examples are taken from the field 
of agricultural experimentation wherein a number of variables are simul- 
taneously counterbalanced by the Latin square technique. Under such 
circumstances it does not matter which item follows which in rows or 
columns since cumulative effects are not being considered. In such a 
circumstance, the Latin square shown by Grant 


A BBD A 
EAC B D 
BDEAC 
AB DCE 
DCAEHB 


is apparently appropriate since each item appears in each row and in 
each column. It should be noted, how 


not appropriate for nonsense syllables 
lists because some of the items follow particular items more than once; 
thus if A has a specific effect on D, such an effect will occur twice be- 
cause D follows A in column 1 and in column 5; again E follows C twice, 
B follows E twice, A follows B twice, etc. A little experimentation will 
demonstrate that it is impossible to arrange any odd-numbered set of 
items into a square without this difficulty, whereas there is no such 
problem with even-numbered lists. On page 439 in a foot-note discussing 
the Greco-Latin square, Grant states that a square of such a nature 
could not be developed with 6, 10, 14, and 18 unit sets of variables; 
this might be misconstrued to imply that experimenters should work 
with sets of lists or lists with odd numbers of items, whereas the con- 


trary is true with the Latin square if cumulative positional effects are 
to be avoided. 


ever, that such an arrangement is 
either in one list or in a series of 
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NOTE ON THE POSTMAN-STONE CONTROVERSY 


JACK BERNARD 
The Klein Institute for Aptitude Testing 


Those readers of the Psychological Bulletin who are interested in the 
aw of effect have been provided with 


present status of Thorndike's 1 
ar-long exchange of "notes" 


entertainment, if not information, by the ye 
' between Postman and Stone (4, 5, 6). Since this exchange 


has reached the point where it consists largely of emotionally-charged 
words and phrases, it is perhaps time for a hitherto uninvolved party 
to risk an attempt to reconcile their divergent viewpoints. 
Thorndike’s law of effect, in its 1932 reformulation, served to split 
al opinion concerning the role of punishment in learning and 
al studies. Postman's bibliography of 332 titles (3) 
] enthusiasm with which that problem was at- 
tacked. However a half-century of extensive and intensive experimen- 
tation has served only to increase the divergence of psychological 
opinion relative to the effect of reward and punishment on learning. 
Both proponents and opponents of the law of effect have accumulated 
stores of research ammunition to justify their positions. Experiments 
may be cited to back the position that punishment helps, hinders or has 
no effect on the elimination of errors (e.g. D and this is true of reward 
as well. There seems to be experimental evidence available as "prol 
of any pet thesis which any individual may care to state as a "law, 


and “replies’ 


psychologic 
to spur experiment 
attests to the vigor anc 


including the four logical possibilities suggested by Stone (5, p. 154). 
Thorndike's law of effect in particular has not stood the test of time 
and research in terms of achieving the universality implicit in its for- 
mulation. Nor, in the opinion of the writer, can 1t withstand the test 
of common sense and everyday experience. If each of the trio involved 
in the present discussion were to burn his fingers in a new and un- 
familiar type of fire, it is to be doubted if their later avoidance of that 
fire would have the indirectness of the creature who, according to 
Thorndike's “law,” is "Jed by the annoying cs d : do d 
thing else to the situation which makes him later uei d ds D A low 
the original connection" (7, P- 71). Human behavior, ead t A e pie 
of psychologists, tends to be somewhat more complex than that o the 
B cerea that any effort to generalize within the important 
51 
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area of reward and punishment is hopeless? Not necessarily. It de- 
pends, to a considerable extent, upon how wide a generalization is 
attempted. It appears probable that any attempt at a "Jaw" regarding 
“the effect” of reward and punishment on learning in general is fore- 
doomed to failure. Available evidence suggests that reward and punish- 
ment do not have a single effect upon all types of learning under all 
conditions, but rather that at least four “effects” may be involved 
depending upon the specific situation. Tolman, Hall and Bretnall (8) 
have presented evidence for “laws” of emphasis, motivation and dis- 
ruption, and evidence has also been produced which points to an 
informational value of such punishments as are applied in contiguity 
with errors (e.g., 2). 

It we recognize that any of the above effects may take preeminence 
in a specific learning situation, and that combinations of effects may 
operate simultaneously, we will have gone a long way toward reconciling 
divergent viewpoints and toward understanding the differences in re- 
sults obtained experimentally. We will have recognized that the 
effect of reward or punishment in a learning situation depends on the 
specific nature of the reward or punishment, the specific learning situa- 
tion involved, and, not least of all, the specific individual being re- 
warded or punished. Adherence to the law of parsimony, seeking for a 
single “truth” or "Jaw" applicable to all situations, is admirable. But 
persistent, obstinate adherence to a generalization which has failed of 
its intended universality is illogical and unscientific. 

Thorndike's generalization, popularized as the "law of effect," has 
a firm foundation in experimental research. However many other re- 
searches, just as competently planned, executed, and interpreted, yield 
results not in accordance with this generalization. To the writer, this 
suggests that a generalization as wide as the law of effect is not feasible. 
Perhaps several “laws of effect" are called for, each to describe results 
obtained within a narrower sphere. In the light of available data, 
neither battles of quotations nor battles involving the situation of se- 
lected confirmatory experiments serve a useful purpose. It may be 
more profitable to recognize the contradictory nature of the available 
evidence, recognize the possibility that at least to some extent 


“I am right, and you are right, 
And all is right as right can be," 


and seek generalizations which will fi? all the available data more ade- 
quately than does the law of effect. 


————————M 
a 


NOTE ON THE POSTMAN- 


STONE CONTROVERSY 53 


BIBLIOGRAPHY 


1. BERNARD, J. The specificity of the ef- 
fect of shock on the acquisition and 
retention of motor and verbal habits. 
J. exp. Psychol., 1942, 31, 69-78. 

2. GiLBeRT, R. W. The effect of non- 
informative shock upon maze learn- 
ing and retention with human sub- 
jects. J. exp Psychol., 1936, 19, 
456-466. 

3. Postman, L. The history and present 
status of the law of effect. Psychol. 
Bull., 1947, 44, 489-563. 

4, Postman, L- Discussion of Stone's 
note on the law of effect. Psychol. 


Bull., 1948, 45, 344-345. 


5, Stone, G. R. A note on Postman's 
review of the literature on the law 
of effect. Psychol. Bull. 1948, 45, 
151-160. 

6. Stone, G. R. Reply to Postman. 
Psychol. Bull., 1948, 45, 536-537. 

7. 'THORNDIKE, E. L. The psychology of 
wants, interests and attitudes. New 
York: Appleton-Century, 1935. 

8. TorMaN, E. C., Hatt, C. S. & BRET- 
NALL, E.P. A disproof of the law of 
effect and a substitution of the laws 
of emphasis, motivation and disrup- 
tion. J. exp. Psychol., 1932, 15, 601- 


614. 


BOOK REVIEWS 


Bonixc, E. G., LANGFELD, H. S., & WELD, H. P. (Eps.). Foundations of 
psychology. New York: Wiley, 1948. Pp. xv +632. 


The Foundations of Psychology is the successor of the editors' pre- 
vious elementary texts: Psychology: a factual textbook, 1935, and In- 
troduction to Psychology, 1939 (both reviewed in TuIs Journav by the 
writer: 1935, 32, 595-599; and 1939, 36, 784-786). Although the editorial 
board remains the same, fifteen of the present eighteen contributors are 
new. It is no longer true, as it was for the previous editions, that most 
of the authors are "descendants" in the Titchenerian tradition. The 
present group of authors shows a great diversity of theoretical and pro- 
fessional background, and their writing well justifies the editors' state- 
ment that "In 1948 the important thing about the organism is not that 
it is conscious, but that it reacts to stimulation." 

The sequence of chapters and their authors are as follows: the nature 
of psychology (E. G. Boring); the response mechanism (C. T. Morgan); 
response (C. T. Morgan); growth and development (L. Carmichael); 
feeling and emotion (W. A. Hunt); motivation (D. W. MacKinnon); 
learning (C. Hovland); retention and transfer of learning (C. Hovland); 
recollecting, imagining and thinking (T. A. Ryan); perception (E. B. 
Newman); sensation and psychological measurement (S. S. Stevens); 
color (F. L. Dimmick); visual space perception (F. L. Dimmick); hear- 
ing (E. B. Newman); taste and smell (C. Pfaffmann); somesthesis (F. A. 
Geldard); topographical orientation (D. G. Griffin); individual differ- 
ences (A. Anastasi); heredity and environment (A. Anastasi); efficiency 
(M. E. Bitterman and T. A. Ryan); personality (L. F. Shaffer); per- 
sonal adjustment (L. F. Shaffer); vocational selection (C. L. Shartle); 
attitudes and opinions (H. Cantril); and social relations of the indi- 
vidual (L. P. Crespi). 

This is an impressive list of highly competent authors assigned to 
write on the broad fundamentals of general and applied psychology. 
The sequence of topics conforms to widespread opinion, and the various 
presentations are in general clearly and authoritatively written. The 
chapter on topographical orientation is the only one whose presence is 
apt to be seriously questioned. The subject is interesting, as are many 
others not included in the book, and well organized; but to the reviewer 
the topic has no valid claim for inclusion as a major item in an intro- 
ductory text. Were it not for the fact that the chapter on "Response" 
is already probably overloaded with a miscellaneous series of topics, 
one would be inclined to say that topographical orientation belonged 
in part there and in part under "Hearing" and ‘‘Visual Space Percep- 


” 


ion. 
It fell to Boring’s lot to write the first chapter on “The Nature of 
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" the most difficult one to write, the least satisfying one in 
the earlier editions, and the one of most questionable value in any begin- 
ner's text. Except in this first chapter, there is almost no reference to 
consciousness in the volume, other than the casual use of the term in 
such expressions as "conscious pangs of hunger” and “the stream of 
consciousness." Boring, however, tells the reader emphatically that 
consciousness is a part of the subject matter of psychology, the other 
part being behavior. The individual authors nevertheless do not seem 
to justify such a statement by their treatment of the science. Certainly 
the student will be at a loss to find discussions of consciousness in the 
body of the text to compare or to coordinate with the accounts of be- 
havior. Boring's statement that “only a hundred years ago you would 
have been told that psychology is the study of consciousness" (p. 5) will 
also seriously confuse the student unless he notes (p. 9) that “As late 
as 1910... most psychologists ... believed that physiology studied 
the body, that psychology studied consciousness and that introspection 
was the direct way of getting at consciousness." The place of conscious- 
ness in the psychology of the first chapter is further confused, for the 
reviewer and perhaps for professional readers, by the statement that 
"A science of psychology cannot, of course, be built up on any one 
so most of consciousness for every psychologist is 
the experience of others." Presumably this experience of others is ac- 


cessible to scientific study. In the discussion illustrating the nature of 
includes the following statement: “One of the 
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experimentation, but it is not the only one nor is the interpretation of 
“concomitances as causes and effects" as wise and safe a procedure as 
the statement implies. To interpret correlation coefficients or learning 
curves as cause and effect relations between the variables concerned can 
result in some strange conclusions. The experimental methods of agree- 
ment and difference, to use Mill's terms, deserve more specific recogni- 
tion than they get under the author's term "control." Where one deals 
with the determination of the effect of benzedrine on intelligence scores 
or with the effect of the extirpation of one brain area upon behavior, it 
is confusing to try to fit the methodology to concomitant variation as 
though one must always deal with continuous functions. 

Although the authors of the various chapters present highly com- 
petent and sometimes brilliant expositions of their topics, it is always 
possible for a reviewer to indicate ways in which the accounts suffer or 
ways in which they could be improved from his point of view. In the 
spirit of this observation, the following points are selected for mention: 
(1) In spite of the amount of space allotted to the nervous system and 
the receptors, no indication is given of the afferent nerves involved and 
but little mention is made of the effects of nerve lesions. (2) Neither in 
the chapter on psychological measurement nor in the one on individual 
differences can the student learn the simple statistical formulae necessary 
for his work or how to use them. (3) The chapter on color might have 
given less space to color equations and the color pyramid and more space 
to such psychophysiological topics as dark adaptation, integration, 
intensity-time relationships, etc. The section on "physiology of color 
vision" is notably weak. (4) Stevens, in the chapter on sensation and 
psychological measurement, writes as follows: “The sense organs start 
the messages along the nerves, the highways to the brain. When these 
messages merge at the higher centers of the nervous system, when they 
organize themselves and modify one another through interactions and 
associations, we call the result perception. But the bare messages them- 
selves, isolated and apart from their mutual influences, we call sensa- 
tions. We get sensations by analysis, by paying attention to certain 
aspects of our perceptions. . .. By selective attention he [the artist] 
analyzes the organized scene into patches of color and he translates these 
sensations into pigments on canvas.” Now the reader has already 
(painfully) learned, in the chapter on the response mechanism, that the 
messages carried by the nerves are electrical changes. As Stevens states 
it, electrical changes are therefore the sensations, for example patches 
of color; and we get at these electrical changes by paying attention to 
aspects of perception which are the integrated electrical changes in the 
brain. The reviewer does not believe that Stevens means what he says; 
but he believes that the careful student will be confused by the account. 

In the editorial preface to the volume, it is pointed out that the 
present book is about twice as large as the previous edition. ‘‘The strong- 
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ey apid advance of psychology is the need of frequent 
revisions of textbooks. It is nine years since our last book, Iniroduction 
to Psychology, was published. In the meantime we have had a long war 
and à victory which psychological research helped attain. In this war- 
time research much new and valuable knowledge came into being. In 
addition there has been the more normal acquisition of facts as well as 
a clearly distinguishable change in point of view." Here we deal with 
matters of opinion. The reviewer finds no new point of view in the book 
and no chapters on topics which could not have been well discussed 
ten years ago. The reviewer does not believe that the advances made by 
psychology in the past nine years have been so extensive as to require à 
doubling of the size of elementary textbooks. Nor does he believe that 
wartime research produced more than a barely appreciable amount of 
new knowledge in PS} £ gh the applications of existing 
knowledge were varied and extensive. During the war, as before and 
al handicap of applied psychology was the 
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dents will come better to comprehend the temper of some of the famous 
psychologists and the atmosphere in which they wrote. u 

Rand's Classical Psychologists of 1912 had a similar use, but it is out 
of date. History gets out of date. In 1912 the philosophers were much 
more important and the physiologists much less important than in 1948. 
Besides, 1948's past can now contain a little of what was only 1912's 
future. . : 

Dennis does not, however, give enough material to make his book 
satisfactory as a sole text. It is conceivable that a thousand pages con- 
sisting of one hundred excerpts from eighty authors might do as a sole 
text, with the instructor’s lectures filling in the interstices and setting 
up the dynamic interrelations, but I am not sure. Direct quotation is 
so uneconomical. What had to be labored in 1838 can be quickly under- 
stood in 1948. Johannes Miiller’s elaboration of his ten laws of specific 
nerve energies was necessary then, while thought still resisted his 
dicta, but not now, when what he was saying has become a common- 
place. 

The simplest way to indicate the content of this book is to give a 
chronological table of the forty-nine authors and their sixty-one ex- 
cerpts. Here each mention of an author indicates a single excerpt, ex- 
cept when a parenthetical number indicates a joint authorship or more 
than one excerpt from the same author in the same period. In most 
cases readers will be able to guess the nature of the subject matter from 
the date and the author. 


Antiquity: Aristotle (2).:4 ssec eene eed 2 
1601-1650: Galileo, Descartes... ..........ccccceeececucweuee 2 
1651-1700: Hobbes, Mariotte, Newton, Locke................. 4 
1201-1750: Berkeley, Hartley. «2522s x angsee sss saa us 2 
1751-1800: Mesmer, Thos. Young, Dalton.................... 3 
USOT 1800} TROS. Young. » «-eennececakatommskezirstiieamees 1 
1811-1820: Chas. Bell, Thos. Brown.......................... 2 
1821-1830: Flourens, Jas. Mill...............Luuuuuuuuuuuuuun 2 
1831-1840: E. H. Weber, Joh. Miiller......................... 2 
1841-1850: J. S. Mill, Braid, E. H. Weber, Helmholtz.......... 4 
1851-1860: Helmholtz, Fechner.............................. 2 


1861-1870: Helmholtz, Galton 


arene s dori 2p Babee dae. Gi DD deti 2 
1871-1880: Wir Esas ae ess seems avs 2s ep ORG waa + « cocer 1 
1881-1890: Preyer, G. S. Hall, Galton (2), James, Ebbinghaus, 
H. H. Donaldson, Cattell (4).........0000000000........... 11 
1891-1900: Dewey, Titchener, Thorndike..................... 3 
1901-1910: Thorndike (13), Woodworth (3), Binet, Simon, Pavlov, 
TR. Angell.«cseenen rea EEL $8 SEV 65 uum a8 28 bE once 
1911-1920: Watson, Hunter, Cannon, Terman, Kohler (2), Franz 
(35, Dashley Gd) -a o o moeng eteti scuepusinn arrai Aetea 
1921-1930: Yerkes, Carmichael, Barbara Burks, Lashley, Hull... 5 
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. It is plain that Dennis has welcomed the philosophers witl 
discretion, admitted the important physicists, turned definitel ri EE 
the physiologists, and accepted mental testing, while he e cade 
abnormal and social psychology. Galton, Cattell as tester Binet i 
l'erman are here, but there are no signs of Charcot, Janet Freud a 
the contemporary psychopathologists, nor of Le Bon, Tarde, McDou 
gall and the modern American sociotropes. In this Dennis follows mare 
than he leads, for this history which he documents is that part of e 
chology's history which is best known and has been most fully presented 


in texts. 

The distribution against dates seems to me to be about right. Aris- 
totle should come in, but the rest can be omitted until the seventeenth 
century. Rand gives fifteen philosophers before Hobbes to Dennis’ one. 
It is jolly to have at hand that passage from Galileo on pitch and fre- 


quency. At the other extreme of time I pause to consider and assess 
my contemporaries. Dennis cites twelve living men, all except Kóhler 
born in America. Dewey at eighty-nine is the oldest, Carmichael at 
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Perhaps some of Physiol. Psychol., 1902, T, 339-350 would do, if nothing 
better turns up. (3) The only reason for quoting H. BL. Donaldson on 
the temperature spots is that he is already in English. Blix or Gold- 
scheider would be more appropriate, but I should quote von Frey who 
brought this skin theory together and defended it. If the articles in the 
1890's do not yield suitable sections, then his Vorlesungen über Physi- 
ologie (1904) will. (4) Cattell has three papers from Wundt's laboratory 
(1885-1887) cited, besides the famous one on mental tests. It.is not 
fair to the Leipsig laboratory to have Cattell its sole representative. 
Wundt’s remark that Cattell was "ganz Amerikanisch" might be ex- 
tended to include the functional orientation of Cattell's research. Nor 
does Cattell merit more quotation than any other author. I should 
keep one of his three, and add from Wundt's aegis two others which 
have special historical significance—perhaps L. Lange on reaction (be- 
ginning the experimental psychology of attitude, 1888) and Meumann 
on the time-sense (1892). (5) And then I should omit Burks on nature- 
nurture because I do not think her paper had great influence. 

Those five changes would reduce the sixty-one items to sixty. Now 
here are forty-five more that seem to me to be of an historical impor- 
tance as great as the items already included. Besides enlarging the book, 
they would also fill in lacunae and provide greater dynamic continuity 
for the genetic process being sampled. 


Let me begin with Germany and Austria. It would be interesting to have 
something from the astronomer Bessel on the personal equation (1823). Pur- 
kinje’s account of the phenomenon that bears his name (1825) is short, and the 
student will enjoy being shown that Purkinje observed ‘twilight vision’ at 
dawn. It would be well to have something from Herbart (1824) about the limen 
of consciousness and the inhibition of ideas. Then there would be Lotze and I 
should hope the Medicinische Psychologie (1852) would yield a section on local 
signs. Hering ought to come in somehow. Perhaps his nativism (1864) is more 
significant nowadays than his color theory (1874). Next I should hunt for a 
suitable account of brain localization in Fritsch and Hitzig (1870) and for 
proper comment by Brentano on the nature of act and intention (1874). Mach 
ought not to be left out for he leads straight into Gestalt psychology. His 
Analyse der Empfindungen (1886) would provide a section about the positivistic 
answer to dualism or on space sensations. There would hardly be room for 
both. The best thing to take from Stumpf’s Tonpsychologie would be the de- 

‘scription of tonal fusion (1890), but there is also his significant adoption of the 
psychology of function in 1907. We might have both of them. In the wordy, 
polemical G. E. Miiller it is hard to find brief quotable material, but what about 
his psychophysical axioms and his hypothesizing of cortical gray (both 1896 and 
related)? I have already suggested Wundt on elements and their combinations, 
but perhaps we should also have Wundt on reaction times, since mental chro- 
nometry characterized the Leipsig laboratory. It will not do to omit Kiilpe 
entirely, and his 1892 criticism of the subtractive procedure in reaction times 
(especially if Ludwig Lange is included) would do for the early Kiilpe, and his 
1912 paper on thought for the later Kiilpe. In the Wiirzburg School, we might 
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hope for Watt on fractionation (1904) and Ach on determining tendency (1905 
ge raum ee Gestalt psychology, we ought to have, I think, von Ehrenfels 2 
idum n (1890), something from Schumann on perception in his papers of 
ELI (the Gestalt psychologists do not accept Schumann in their ancestry 
u think he is significant), and certainly an excerpt from Wertheimer, ra 
from the 1921 paper in the Psychologische For- 


which I nominate a selection 
schung. That leaves out of my list Loeb on tropisms (1890, or better 1899) and 


Lipps on empathy (1903). 
With abnormal psychology ruled out, the French become less important. 


There are already six French articles included. Gall on phrenology (ca. 1825) 
and Broca on the speech center (1861) might be added. It does not seem right 
to include Binet and exclude Ribot, but so many of Ribot's books were on 
mental discase. Could his Psychologie de l'attention (1889) furnish a significant 
section? And then what about Piéron? Does not France provide one living 


psychologist to rank with America's twelve? 
My recommendations for Great Britain are these. Robert Whytt on the 
reflex (1751). Marshall Hall on the reflex (1832 or 1837). Thomas Reid on the 
powers of the mind (1785) as faculty psychology and leading into phrenology 
i association (1855), in addition to James Mill and 
i 1 evolutionary psychology (1870- 
James used Spencer's Principles as the 


1872) if the right passages C i ; 
y. Ward on the subject-object 


only text in his course in physiological psy \ | c 
relation, cognition from the 1886 Britannica article or the 1918 


text. Romanes (1883) to show how careful he was about the anecdotal method. 


Lloyd Morgan (1894) for his canon. (1904). C. S. Myers ought 
to appear, but I do n f anything truly significant for his role in British 
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can-cartatnly be included on overlappin I think of the 1916 paper in 


the Bri 7 PS chol., because it And then there is McDougall 

on instincts (1908) and on purposive behavior (1923). Do we put Bartlett in? 

O HW 

n reminiscence perhaps e Stanley Hall (1883) in Dennis' 
ists are Americans. 


Now the Americans 

list. F ighteen five psycholog 

i Psat onward eig Pes ice Americans and only five others. 
ica well enough. I do, however, 


ee 1900 there are thir Op d aft pet 

erl . 1 ead ooke after ee . 

Perhaps Dennis i Ea ades t not to be limited to his theory of 

emotion but to have $ i hat shows his position vis-à-vis functional psy- 

sholagy and Gestalt Ps ne from the Principles (1890). 

J. M. Baldwin should appea sychology bit I do not know 
7) Münsterberg should come in on applied psy- 


what to quote s i 
cl à dition to the paper on structural psychology, might be 
hology. Titchener, in addit eeling (1908) or of thought 


: 1 
quoted on introspection 9 04) is needed to offset the 


1 i nsci ~ 1 
Du. jen co sciousness a5 reaction (1915) ought to be in, 
pisticists. E. ?' tionism and modern be- 


m he gags Lea intelligence tests, put I should like 
e Yei b is hould somehow receive the ball 
a see him $ i f the series Reid 
fr tone should be there as the terminus © " 
om Holt. And Thursto? ike, Binet, Spearman, Terman, Thomson. 


none befor 


62 BOOK REVIEWS 


The net result of all this discussion is the suggestion to delete four 
of Dennis’ sixty-one papers and to add forty-eight new ones, making 
a new total of 105. 

Now let us see how this is related to countries and languages. We 
shall count contributions, not men. 


Germany 
and Great America France Greece Italy Russia Total 
. Britain 7 

Austria 
Dennis 12 16 23 6 2 1 1 61 
Dennis 
and 
Boring 34 29 28 10 2 1 1 105 


Of the twenty-nine from Great Britain, nineteen were prior to 1883, 
which is the date of the earliest American paper. The associationists, 
the physicists and the biologists of Great Britain contributed most 
heavily to the early preparation for modern psychology. 

The ratio 34/28 would not seem to me to weight the German papers 
too heavily in relation to the American. The ratio would be still larger 
if you stressed the nineteenth century more heavily in relation to the 
twentieth. Somewhere in the nineteenth century the Germans took 
over from the British, somewhere in the twentieth before Hitler the 
Americans passed the Germans. 

Thus we arrive at the problem of translations. To be used as a text 
in American universities, such a book must be printed entirely in Eng- 
lish. Dennis got along with the minimum of translation, for, of the 
twenty-two contributions in five foreign languages, nineteen were al- 
ready printed elsewhere in English translation. The two pages of Mari- 
otte on the blind spot and the eleven pages of Flourens on the nervous 


om the French for this book. Two 
the nerve impulse were translated 
How much new translating would be re- 


4 al of 105 contributions were to be put into 
effect? Twenty-four Papers, twenty in German and four in French, 


with the possibility that two German translations could be saved if 
you took the less satisfactory but already translated books of Herbart 
and Lotze. That is a very considerable undertaking. It might run to 
three hundred pages of new translation and it is no wonder that Dennis 
did not undertake it. 

Has not Dennis perhaps here made an answer to the contention of 
the Minnesota psychologists that the young new psychologists do not 
need to spend their time learning French and German because "in the 
few instances where a graduate student needs to consult a foreign lan- 
guage publication, it would be more economical of the student's time and 


quired if my proposal for a tot 
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ave the material translated for him" (Amer. 


Psychologist, 1947, 2, p- 136)? If it were so simple as all that, Dennis 
could use the facilities which graduate students use at Minnesota and 
include my twenty-five German and French papers in English trans- 
lation in his second edition. Certainly if we are to get those translations, 
Dennis must get the work done for us. When he has passed on, his suc- 
cessor may not know enough German to decide what to translate. You 
do not simply send the 53,735 German pages which Wundt wrote down 
to the university’s translating bureau for an ‘English’ of them, so that 
you can then decide which are important enough to use. : 

About this useful book of Dennis’ I have one minor bitterness, 
directed more at the published than at the editor. The right-hand page 
heading shows the topic being considered. The left repeats endlessly’ 
(266 times) the title of the book. No chance that the reader will forget 
what book he is reading, and also but little chance that he can find what 
he is looking for by leafing through to it. I tested mysel out p ne 
authors from the sixty-one Page headings and pa sna ase Pe eu 
of the Mind" : that ough Reid, ought it not?—or Duga ewart, 


is aid, of all persons. 
" "AIL in pum Me oe d Dennis earns our gratitude. 
in all the ag 


The courses in the history ill have an additional text, 
courses ! : y iis : 
and the browsing graduate student with scholarly inclinations Wl 
often be found reading it. Epwiw G. BORING. 
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available for representing data, but the material covered is presented 
in a useful way. In only one respect do the reviewers disagree with the 
treatment of curve fitting: the application of a special case of the method 
of least squares to all forms of data. It is obvious that the method of 
least squares is one which is based upon certain assumptions, and if the 
assumptions are not fulfilled by the data, then the results of using the 
method are uninterpretable. When Lewis fits curves other than straight 
lines to data, he achieves the fit by transforming the data to yield a 
straight line and fitting this transformed function by least squares. 
Such a solution involves many assumptions, some of which are men- 
tioned, none of which is emphasized. The problem of what deviations 
from best fit to minimize is always an important problem. If the data 
warrant elaborate treatment, the rationale of the steps used in fitting 
a curve to the data may be developed and may involve a complex 
analysis. It the data do not warrant such treatment, the fit may be 
chosen more obviously on a basis of convenience. In the latter case, 
some of the other methods mentioned by Lewis will yield easier and 
equally fruitful solutions. Considerations respecting least squares also 
apply to the discussion of the index of determination (p. 81). 

Chapters 5 and 6 present a short account of calculus. Differentiation 
is treated in Chapter 5 and integration in Chapter 6. The treatment of 
calculus is probably less extensive than may be desirable. The discus- 
sions of differentiation, integration, and partial differentiation are con- 
densed, and there is no consideration of methods of integration or of the 
use of integration tables. The problems of repeated integration and 
elementary differential equations are not mentioned. 

Most of the latter part of the book (Chapters 7, 8, 10) is concerned 
with problems of the normal curve and problems of goodness of fit. 
Chapter 7, on the normal curve, is good and Chapter 10, on the fitting 
of straight lines for testing the significance of parameters is also valuable. 

Some portions of the treatment of distribution functions (Chapter 
8) and goodness of fit (Chapter 10) will be much less obvious to students 
than the discussion of earlier topics, such as logarithms and differentia- 
tion, and the use of the book as a text may require considerable supple- 
mentary lecturing. There may be some question, for example, as to the 
value of presenting the formula for the chi-square distribution (p. 174) 
without showing how one arrives at it; Lewis uses the formula to arrive 
at the mean and modal values of such a distribution. Such a procedure 
may help to eliminate some fear of large formulae but, from the point 
of view of understanding the distribution, the student might just as 
well be given the mean and modal values arbitrarily. 

Chapter 9 is one of the most valuable chapters in the book. Tt gives 
the student an opportunity to apply the principles he has learned to a 
connected background of theory in his own field. The treatment of 
Hull's theory (p. 233 ff.) is good, and the description of Hecht's photo- 
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d hypothesis 1$ valuable, although presentation of a later versi 

An theory would be desirable. Finally the reviewers believe that 

row's equation . 247 ft. i e ex iv 

bmc ds us (pp ) receives more extensive treatment 

We. s tempted to compare Quantitative Methods with books in other 

bes ls written to serve comparable purposes. In physics, chemistry, and 

eeu d one encounters treatises on quantitative a be 

es range in difficulty from Daniels' Mathematical Preparation p^ 

P en ie Chemistry to Mellor's Higher Mathematics for Students of 
hemistry and Physics and the Sokolnikofis’ Higher Mathematics for 


aa and Physicists. Lewis’ book lies in difficulty somewhere be- 
ween the level represented by Daniels and levels represented by Mellor 
and the Sokolnikoffs. Despite the fact the Daniels book is written at 
it seems to the reviewers that 

f view of training 


a more elementary level than Lewis', 
the Daniels work is more satisfactory from the point o 
ticularly in applications of 


in basic mathematical techniques, par 


calculus. 
It may be argued that a course which is primarily of psychological 


interest should not treat topics such as calculus which can be learned 
in other courses. Nevertheless, it has been the experience of the re- 
viewers that a course in quantitative treatment which emphasizes the 


application of calculus is of great benefit in eluci 


problems in a student’s own field. 

A great deal of space in Quantitative Methods is taken up 

cal theory. This orientation is an important one and must not be under- 

estimated. Nevertheless, it presents certain difficulties, not the least 
f thinking which is more 
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book stimulates the mathematical training of psychologists, it will be 
a major contribution, and other books dealing with the same topic may 
be expected to follow. If these books improve on Lewis’, it is likely 
they will do so by emphasizing the rationale of analysis rather than the 
techniques. In any case, Quantitative Methods evidences an appreciation 
of an important gap in the training of psychologists and fills the gap 
with no little success. 
C. H. GRAHAM. 


C. G. MUELLER. 
Columbia University. 


Rucu, FLovp L. Psychology and life. (3rd Ed.) New York: Scott, 
Foresman, 1948. Pp. 740. 


Ruch's new edition of Psychology and Life is an enlarged, better writ- 
ten and illustrated version of his earlier volumes. 

In evaluating such a textbook one must of course acknowledge that 
the merit of a particular book depends upon the instructor’s interests, 
bias, and philosophy of the purpose of psychology. Relative merit also 
depends upon the level of ability of the student, the major course the 
student is pursuing, and the probability of his taking further psychology 
courses. 

If the instructor wishes to emphasize the physiological-experimental- 
scientific approach, Ruch's third edition probably falls short of Munn's 
Psychology. lf the instructor wishes to remain in the traditional frame 
of perception, sensation, learning, memory,thinking, but with perhaps a 
bow to the application of psychology to life, the Psychology of Wood- 
worth and Marquis is probably more suitable than Psychology and Life. 

However, for the instructor whose students will have little further 
work in psychology and who expect some immediate answers to every- 
day problems, Ruch's revision has no competitor. Whereas, his former 
editions have been overly talkative in parts, have admittedly stressed 
subjects "interesting" to the student, have been weak in systematic 
presentation, the present edition has none of these weaknesses. State- 
ments are well documented. The writing is direct, easier to understand, 
better organized. In the trend of picturization so well done by Munn, 
Ruch has added a variety of excellent drawings, some in color, and a 
number of well chosen photographs. 

Considerable material has been added by reducing the size of print 
and tightening the exposition. Several chapters are now the result of col- 
laboration with others: two chapters on learning (with Robert Leeper): 
"Emotional Development: Clinical Findings" (with Joseph Shoben); 
and “Illustrated Reference Manual: The Brain and Nervous System’ 
(T. C. Ruch). Several editorial consultants are acknowledged. 

A section on emotional adjustment includes some 135 pages, and is à 
distinct contribution for the student who is looking to psychology tO 
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^n and others’ behavior. Techniques of 


give him some insight into his ow 
psychotherapy such as non-directive counseling, finger-painting, play- 
in an interesting and basic 


therapy, and psychodrama are presented 
manner. A new chapter on ‘Problems in Human Relations" which dis- 


cusses marriage, parenthood and delinquency is characteristic of the 
movement into the applied area. 

As to adverse criticisms, the revision will appear to many to but em- 
phasize an already undesirable “popularizing” of psychology. It can be 
said for example, that it is unsound to introduce the topic of marriage in 
a beginning course in psychology, that marriage is the province of sociol- 
ogy, not psychology- Others who may accept marriage as a field for 
psychology will affirm that nothing good can come from a seven-page 
treatment. Many will be querulous about including such paragraphs as 
"What About Dunninger?" and “Is there Extra Sensory Perception?” 
(Students ask these questions even if the textbook writer does not.) 
Again, as in previous editions, the good rats do not run the students 
through mazes, although more attention is given to them than previ- 
ously. This will no doubt continue to be construed by a few as an un- 


desirable movement toward understanding people. . . 
i popularize his writing—but most of 


In fine, Ruch has continued to 

the previous criticism concerning looseness, lack of fact, and lack of a 
basic approach, have been met. Ruch has played to his audience, as any 
good teacher must—and why not? 
The third edition of Psychology an 
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been well selected. What is equally important, I think there will be 
wide agreement that the book is unusually well written. The author's 
style is lucid and smooth. v o 

The unique feature of his book is that the traditional subdivisions 
of educational psychology have been almost entirely ignored. One will 
look in vain for, say, a discussion of the law of effect or of Gestalt psy- 
chology. For this reason, the book nay not appeal to some who organize 
and teach their courses in educational psychology in the traditional 
pattern. It seems to me, however, that good reasons can be adduced for 
reorganizing beginning courses in educational psychology to make them 
more functional, more interesting, and more nearly up-to-date. Dr. 
Davis has provided not only an interesting and well written text but a 
stimulus to further experimentation in presenting the subject matter 
of educational psychology. 

FREDERICK B. Davis. 
George Peabody College for Teachers. 


GnuisELLI, E. E., AND Brown, C. W. Personnel and industrial psychology. 
New York: McGraw-Hill, 1948. Pp. xiii 4-475. 


As a survey of the major applications of psychological techniques to 
personnel problems in business and industry, this book is designed to 
meet the needs of university students, as well as those of persons in 
business and industry who desire to obtain a foundation in the basic 
principles of industrial psychology. The book represents a culmination 
of four earlier mimeographed editions, which were used in classes at the 
University of California over a period of several years. 

The sixteen chapters are arranged in a logical sequence. Following 
the introductory chapter, in which emphasis is placed upon the scien- 
tific method as applied to human (psychological) as contrasted with 
mechanical (physical) factors in production, the topics of job and worker 
analyses, criteria of job proficiency, and rating methods are successively 
treated. An understanding of these techniques is basic to all of the 
topics subsequently presented. The next chapter deals with the inter- 
view and personal-data analysis, and is followed by three chapters 
dealing with various aspects of the validation and use of psychological 
tests. Subsequent chapters treat the definition and meaning of effective 
work (including efficiency, fatigue, etc.), conditions of work and pro- 
ductivity (diurnal changes, length of work period, rest pauses, shift 
work, physical conditions of the work environment), effective methods 
of work and the design of tools and equipment (time and motion study; 
work simplification, layout, and what has recently come to be called 
"bio-mechanics"), industrial training (two chapters), accidents and 
safety, monotony, and worker motivation and morale. The volume 
contains 115 tables and 38 figures (all line drawings), selected references 
at the end of each chapter, and terminal indexes by author and subject: 
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ook is identical to that of other volumes in 
ler the Consulting Editorship of Dashiell. 
. this book provides an excellent presenta- 
tion of the basic facts and methods of industrial psychology, and repre- 
sents the best textbook currently available in this field. The presenta- 
tion is sound and lucid, and there is repeated emphasis upon the need 
for empirical validation of techniques. Although classical material is 
well represented, frequent reference is made to recent experimental 
data, such as those obtained in World War II. The limitations as well 
as the possibilities of various techniques are clearly indicated. Thus, for 
example, in the discussion of the limitations of personality question- 
naires it is excellently stated that the validity of such a questionnaire 
"will be a direct function of the honesty and accuracy with which the 
individual answers it" (p. 169). Hence the reader is prepared for the 
conclusion that personality inventories are far more valid in guidance 
or counseling than in selection situations. 

Despite its many virtues, however, this book shows a few unfor- 
tunate short-comings, many of which represent aon e omission. 
Certain topics, such as job evaluation, are m pede eint are 
such an omission is, curiously enough, characteristic © most textbooks 

Nor is any reference made to 


by ps ‘sts in tl industrial field. 
psychologists in the Men pm s ^ 
market survey techniques—an omission which is less serious since the 


authors evidently regard consumer research as outside the field of 
industrial psychology Another minor but important omission occurs 
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In the introductory chapter and in certain other places throughout 
the book the authors state a theoretical point of view which may be 
misinterpreted by those psychologists who are quick to applaud any 
deprecation of objective and rigorous formulation of psychological 
principles. Thus, the authors state that "interest should be centered 
upon the worker as a thinking, feeling, and desiring organism, and not 
as a mechanical device that responds in given ways to appropriate stim- 
uli" (p. 2). The intent of the authors would appear to be that of empha- 
sizing the error of a priori assumplions regarding the worker's motiva- 
tional stimulation, the importance of considering the motivation and 
attitudes of the individual worker, and the need for experimental investi- 
gation of such motivation and attitudes in specific working situations. 
Unfortunately, however, many readers with a mentalistic bias will mis- 
interpret such a statement, and will construe it as a denunciation of the 
applicability of S-R concepts—or even of the scientific approach—to 
the industrial field! The writer feels certain that Ghiselli and Brown 
would not subscribe to such an obscurantistic viewpoint. 

One further point deserves special mention. In reviewing the his- 
torical background of the applications of psychological and engineering 
techniques to production problems (cf. pp. 6-7, 14-16), the authors in- 
dict management—and indirectly the engineer and the industrial psy- 
chologist—for having in the past directed attention solely to the goal of 
increasing production and hence of increasing profits. It is pointed out, 
to be sure, that industrial psychologists have, in recent years, fortu- 
nately turned their attention to problems much broader than that of 
"production and profits." The authors fail to emphasize, however, that 
the services of competent consulting psychologists are available to labor as 
well as to management, and that the failure of labor unions to utilise such 
services cannot be laid at the door of either management or the consulting 
psychologist. Although this point is not made by the authors, there is 
a hint of it in a later section of the introductory chapter. In discussing 
the need for research under actual working, rather than artificial labo- 
ratory, conditions, the authors state that such an objective ‘would 
require that industrial and union organizations jointly provide the 
means for establishing research groups within the industrial organiza- 
tion" (p. 18). 

Everything considered, however, the merits of this book clearly out- 
weigh its shortcomings. It represents a real contribution to a field in 
which there are all too few good modern textbooks. 


Joun P. Forzvy, JR. 


Industrial Division, The Psychological Corporation. 


Burtt, Harotp E. Applied psychology. New York: Prentice-Hall, 
1948. Pp. x-+821. 


The advent of a new textbook whose author is a recognized pioneer 
in his field is always an interesting event. Most of n« are encaced in 
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podrta task of finding a text which best suits our needs and 
E T ES important, however, is the fact that such a book inevi- 
pr de ies a ees or at least a delineation, of a field. These 
a E s, Rum requently than those of the more concise and formal 
nm ae alleat mee the subsequent course of work in the de- 
TL a. This is particularly true if, as in this case, the author writes 

r students and laymen (perhaps actual or potential employers of ap- 


plied psychologists) alike. 

r It must be recognized, in short, that Dr. Burtt's book will have con- 
siderable influence upon what psychology instructors, their students 
and laymen think applied psychologists should do and how they should 
doit. Its evaluation in this respect, as well as a pedagogical instrument, 


therefore seems in order. 
The scope of this volume is wide. Apparently, the author conceives 


of applied psychology as à field which includes all efforts made by psy- 
chologists or other individuals to help anyone, regardless of whether 
the principles or methods of scientific psychology are involved. After 
an introduction and a quick tour of the field of general psychology, 
there follows à series of. chapters which inspire the reader with con- 
siderable respect for Dr. Burtt's breadth of interest and mastery of a 
voluminous literature. Three chapters are devoted to the contribution 
of psychology to education, two to personal efficiency and vocational 
guidance, three to medicine, d crime, two to employment, 
three to industrial efficiency and morale, four to advertising, and one 
to “outlying fields" which include “Life Enjoyment” (the arts, recrea- 
tion, and athletics); “Communications” (public speaking, writing, 
drama, movies, radio, television, an the press), and ‘Social Progress" 


(public opinion, politics, religion, War» and peace). . 
This implied definition ° applied psychology will probably be as 
disturbing to some readers as it is to this reviewer. These readers will 
wonder if perhaps d a re-definition which would 


the field does not nee t 
make it unnecessary f a textbook to include a treatise 
iatry, the problems o 
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that he be familiar with the reflex arc concept or the alleged effects of 
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ful attention to detailed procedures has, in certain instances, resulted 
in repetition or reduncancy, although this tendency has not unduly 
marred the effectiveness of the book. 

'The discussion of procedures is distinctly not of the ivory-tower 
variety, since recognition is constantly given to those practical situa- 
tions which may preclude a close hewing to a given theoretical line. 
For example, the authors make no bones about the acceding to the in- 
terests of either management or labor on the inclusion or exclusion of 
any given factor, in order to keep peace in the family. 

The most vulnerable spot in the book seems to be the acceptance 
of the point of view that the selection and weighting of factors must of 
necessity be made on the basis of the best “guesses” (the authors use 
the word '"judgments") of those responsible for the development of 
the system, relying on the past experience of other companies as à 
guide. It is entirely feasible for a company to determine factually the 
relative importance of the factors that are used in its system; the state- 
ment that "'there is no known method and no known criteria for deter- 
mining the relative values in an objective manner" seems just a bit 
strong in the light of the fact that such relative values have been deter- 
mined in specific situations. 

As a contribution to clear thinking in the field of job evaluation 
the book is of material significance. With full awareness of psycholo- 
gists' taboos about dealing with superlatives, it is the opinion of this 
reviewer that the book is the most adequate over-all manual on the 
philosophy and techniques of job evaluation that has yet appeared. 


C. H. LAWSHE. 
Purdue University. 


KAPLEN, Oscar J. (Ep.) Encyclopedia of vocational guidance. New York: 
Philosophical Library, 1948. Vol. I, pp. xxi-I-722. Vol. II, pp. 700. 
$18.50. 


Three hundred authorities in eleven countries contributed articles 
to this Encyclopedia. Topics which are covered include: vocational 
guidance activities throughout the world; vocational guidance periodi- 
cals and organizations; the handicapped; problems of minority groups: 
aptitude patterns in the principal occupations; occupational trends in 
the United States; economic aspects of guidance; techniques of inter- 
viewing: evaluation of guidance procedures; administrative aspects of 
vocational guidance; descriptive articles on the principal psychological 
tests; and other subjects. 

The editor states that this book ‘‘is encyclopedic only in a relative 
sense; it does not pretend to cover in great detail all phases of the guid- 
ance field. The topics included were chosen from a master list and many 
important subjects were omitted because of lack of space." The en- 


BOOK REVIEWS 75 


cyclopedia is designed primarily for professional counselors who d 
have large library resources. Ee 
Because economic aspects of vocational choice are so important 
articles have been prepared on occupational trends, unem loym c 
cost of preparation for the major professions, use of census data En 
average earnings. No attempt has been made to present inipcnation 
on the training requirements, economic outlook, or general nature of 


individual occupations. 


Topics are arranged alphabetically in formal encyclopedia style. 


For example, in the first 15 pages we have the following articles: ‘‘Ac- 
countancy, Aptitude For"; "Achievement Tests, General"; “ Achieve- 
ment Tests, Use of”; “Acrophobia”; “Alliance for Guidance of Rural 
Youth”; “American Council on Education Psychological Examination”; 


“Acting, Aptitude for.” 
It must be emphasized that these two volumes comprise reference 
material and are to be used for the purpose of acquiring pertinent in- 


formation quickly- It should be used in the same manner that a coun- 
selor uses the Dictionary of Occupational Titles—when a need arises. 
For example, if a counselor is called on to assist a blind person or a hard 
of hearing individual or à member of a minority group; à quick reading 


Dic beforehand will be of great value. 
Many of the articles are descriptions of principal psychological 
aring these presentations with the 
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i inating counselor will, I am sure, find the 


ments Yearbook. Th é re, 
Mental Measurement helpful tool in deciding what 
tests to order or to use. Evidently, Dr. Kaplan has a conflict about in- 
cluding material on tests. An c f Vocational Guidance 

could not well omit such information; On the other hand, the Mental 
ption and critical comments. 
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one using tests $ 

little value :tical luations of classes of t 
. " ests 
: merit are the critica evaluations of € » sts, 
Articles d s Critical Evaluation of the Principle and ‘‘Clerical 
such as, “Art tests, À " Many critical comments on specific 
summaries. From the user's standpoint it 
find complete information on a 


Aptitude Tests; 
articles to 


tests will be fou 
means reading at ! 
given test. 
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The physical make-up of the books causes annovance. There is no 
indication that Vol. I contains topics A through L and that volume II 
starts with M. Within a volume it is difficult to find a given title be- 
cause of the lack of any indexing. Appropriate letters on the corners of 
the pages or thumb-nail indexing like a dictionary would facilitate 
quick reference. : . E _ ran 

Despite the objections mentioned the Encyclopedia isa contribution 
to the field of guidance. Along with the Dictionary of Occupational 
Titles and Buros's Mental Measurements Yearbook it should have its 
place on the bookshelf of every vocational counselor. 

STANLEY G. Dursky. 

Chicago Psychological Institute. 


NAUMBURG, MARGARET. Studies of "free" art expression of behavior 
problem children and adolescents as a means of diagnosis and therapy. 


New York: Nervous and Mental Disease Monographs, 1947, No. 
71. Pp. xi4-225. 


This monograph is a collection of six articles published by the author 
in various psychiatric journals and in a volume on Child Psychiatry 
during the period between April, 1944 and January, 1946. All six 
studies are part of a research project at the New York State P 
Institute and Hospital and are devoted to “the investigation of the 
possible use of 'free' or spontaneous art expression as an aid in both 
diagnosis and therapy" (iii). Each study demonstrates the extent and 
nature of the fulfillment of the project's aim by giving a detailed analy- 
sis of one child's graphic productions, conversations and general be- 
havior during his periodic art sessions with the author. 

. By means of numerous reproductions Miss Naumburg shows con- 
vincingly how impersonal routine tracing and copying, common among 
public school children, gradually is replaced by spontaneous expression 
of personally significant subject matter: “As soon as original art work 
in encouraged, instead of dependence on models and Specific techniques, 
the focus of a patient's art activity is modified. He will begin to draw on 
his own inner resources and this will inevitably lead to some expression 
of the conflicts within the personality, which may reveal aspects of the 
pattern of his mental disease as well as the specific insecurities or trau- 
matic experiences within the patient" (p. 50). By releasing these 
largely hidden forces the author feels that the art productions of her 
patients serve the dual purpose of aiding the psychiatrist in diagnosis 
and in therapy. Whereas in a setting of this type both functions are 
more or less interwoven, the former goal is served primarily by bringing 
to the fore hitherto unverbalized diagnostic material, thus corroborat- 
ing, anticipating or broadening psychiatric findings. The therapeutic 
contribution is in form of catharsis, substitute satisfaction for sup- 


sychiatric 
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n ree . 
pressed desires and strengthening of self-esteem through artistic ac 


complishments. 
Although the author's main purpose is to demonstrate the value of 


free i i 
art expressi inic i i 
pression 1n clinical practice, occasionally she reaches out be- 


rn rca meii ln etd eni 
SP exu iridis ance é as a language of communication in the life 
at every individual, whether he happens to be mentally disturbed 

at peace with himself” (p. 89). In. fact, in one place we find the d 
reaching assertion: “Tn the East unstinting recognition has always test 
accorded to art as an expression of and not as an escape from reality; 
could the West bring itself to an acceptance of all forms of creative 
expression as a universal normal and integrating experience that is 
neither effeminate not neurotic, our culture might again find ways to 
restore harmony and balance to the disequilibrium of the modern 
psyche” (p. 90). Whether or not we follow Miss Naumburg all the way 
in these contentions, it cannot be denied that creative activity has a 
cathartic as well as constructively fulfilling effect. Any efforts to en- 
courage its broader use in such fields as education, recreation or therapy 
can therefore only be welcomed. 


So much for the main theme of thi ically 
terested reader will surely find it stimulating and worth looking into. 


Those who wish to try out the technique discussed in it will appreciate 
both the specific sugges 's actual procedure of 


tions regarding t ; A 
starting the process of “liberating throu 1l as illustrations 
of her use of V ic expressions. 


erbalizations concurre 
A person with wide 


r interest, 

amount of disappointment a 
that some attempt > 
rich and suggestive material for an advancement of our ca nape 

r : i i 
of general theoretical : eaa ularly the 
ception, the psychological n n eaj e * 
last study in the volume shou elf well to theore n = ra- 
tion. It should be remembered, ; f Miss aum- 
burg's study was clearly st land not theoretical in nature 
and we all know how ifficult € Pap od 
divergent aims V e framework of the sam s rch project. he 
theoretical potenti of this materia suggested, there- 
fore, as a realm for further exP 
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s monograph. The clinically in- 
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to book form with- 
for a brief preface 
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à us unification ° organization except r 
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out any attempt at 
by the author anda 
generalizations are repeated i 
where a long pa$5 


ord by Nolan Lewis 5 y 
There is even one instance 
ted verbatim in another, 
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when a mere page reference would have sufficed. In addition to this 
repetitiousness among the separate articles, lengthy descriptions of the 
reproduced pictures appear in practically identical form in both the 
text and the caption of the picture. In short, greater editorial thorough- 
ness would have much enhanced the readability of this worthwhile 
report. 
i MARIA A. RICKERS-OVSIANKINA. 
Wheaton College. 


LINNDER, ROBERT M., AND SELIGER, Ropert V. (Eps.) Handbook of 
correctional psychology. New York: Philosophical Library, 1947. Pp. 
691. 


The kindest thing that can be said about this volume is that is a 
pioneering venture in an area that has been very inadequately explored. 
Although the authors titled the book “correctional psychology," and 
the inference is that by “correctional” is meant therapy in the penal 
sense, an analysis of the articles reveals less than one-quarter would 
not fit primarily into pure medicine, physiology in its applied phases, 
or sociology, especially in its forensic aspects. 

For example, the reviewer was surprised to find chapters such as: 
“Electroencephalography: used in penologic practice”; "Sugar metab- 
olism in its relations to criminology”; “The intramural practice of eye, 
ear, nose and throat" ; "The acute medical patient"; “ A venereal disease 
program in prison"; “Use of sedative measures in mental hospitals"; 
""Treatment of traffic offenders." While each and every one of these 
and other chapters can be argued for in a volumeon criminal psychology, 
to say that the sum of articles such as these are "psychology" is to 
stretch the semantics a bit! 

The bug-a-boo of all editors of compilations—variations in style and 
quality is well evident here. Some articles, like Brancale's “The classi- 
ors mane ina wide m institution" and Selling's “The extra- 
institutional treatment x "ar E 
logically within the premier e rc «apad adr 

l r ; ake the book valuable for 
those in the field, while other articles seem either so peurile, forced, or 
down-right silly that they make the reader wonder why they are in- 
cluded in a serious volume apparently designed for the use of specialists. 

To whom is the volume addressed? It is too specialized to be used 
as a text in criminology; it is too superficial for the use of specialists in 
any area in penology; it is the source of too many original articles to 
be passed by. It is probably best suitable for a text for people in in- 
service courses in prisons who need to know the functions and problems 
of associated disciplines. 

The need for a book that will discuss the ori 


$ gins, dynamics, therapies 
for criminal behavior has not been met. The p 


resent volume is not well- 
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er is the field of penology th z 
that the editors were unable to find a de aiey g fea x 
Psychological Association who was employed in the penal area Hi 
into this collaboration of 46 minds is a sad commentary on ti acis 
siata of penal psychology- p 
he atrocious technical aspects of this volum 
The very first word of the book is donde rata e of 
Editors'. We learn there isa test known as "Revised re bel ast 3 
In the Table of contents" the titles of the chapters are not ed 
with the names of the authors. Some chapters have pierra fie 
some do not. One bibliography is about 300 pages out of Mack. Phare 
is no index. The articles are not sectioned off in any logical manner 
For the price, a more substantial binding would have been e" regle f 
In sum, it would be fair to say that there are various chapters that í 


and are unique and for these chapters the book is well 


are well written 
] that while the assembly job seems sloppy that there is 


worthwhile; anc 
i e volume to recommend it for the person interested 


enough gold in th 
in correctional psychology: 
R. J. CORSINI. 


balanced, but then neith 


San Quentin. 
SYLVIA. American opinion 


R., AND EBERHART, 
on: Princeton Univ. Press, 


COTTRELL, LEONARD S.J 
tomic age. Princet 


on world a, airs in the a 


atest scientific achievements, and perhaps the most 
orld War II, was the atomic bom 

. This and the 

asaki were primarily ‘ple for bringing 

i 's understanding of 


out, it was n 
but rather the experience of 
dread of many more that was 


effective. 
Atomic scientists have pointed out that nuclear knowledge of this 
kind cannot forever that no adequate defenses are 
known or can be vised. Itis also conceded that the use of atom bombs 
in another War c ilizati s we know it. Be- 
mity of the implications of these new scientific develop- 
ments, plans Wer lated immediately after the war for the develop- 
energy for peaceful uses. One of the tests 
tion Crossroads," 


ment and control of atomic k 
known as Opera 
an's choice of the use of 


significantly named—man 
tional basis, the decision 


One of the gre 
powerful psyc 
which was drop 
later one dropped 
end, and openec 
h t surrender, 
bomb will do, pls the 


may well have been 
atomic energy p 
would seem clear. 
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As a psychological factor, the atom bomb had an indisputable effect 
on the whole world. All the old concepts of war were changed overnight, 
and a covert fear gripped people of all nations, including those of the 
United States, which was at that time the only country in possession of 
atom bombs, or of the ‘‘secret’”’ of producing them. The existence of the 
bomb imposes a great moral obligation on people throughout the world 
to outlaw its use for war; and the means of its ultimate control rests 
with a strong international organization. 

This is certainly the view of most physicists and atomic scientists, 
and it was not long after the war before organizations were formed such 
as the Emergency Committee of Atomic Scientists, and the Association 
of Education for Atomic Energy. 

The present book by two social scientists is based on a report pre- 
pared for the Committee on Social and Economic Aspects of Atomic 
Energy of the Social Science Research Council. It is primarily a sum- 
mary of a study made in 1946 of American attitudes regarding foreign 
affairs, especially in relation to atomic energy. 

The most interesting thing psycholo 
ignorance of large segments of our popul 
for example, one-third of the people interviewed could not give even 
the simplest answer to the question, “What is the main thing the United 
Nations organization is set up to do?”’—but rather the contradictions 
in attitudes. For example, the majority of people interviewed indicated 
that they approved the idea of going beyond the idea of a United Na- 
tions to a world government, but at the same time objected to lowering 
tariff barriers. Likewise, most people were veering away from the notion 
of isolationism, but at the same time did not think it wise to give or 
lend money to people of other nations. Of course, contradictory atti- 


tudes of which these are illustrations are well known to social scientists 
and others who have carried out attitude studies, 


Regarding the atom bom b, most people apparently felt so completely 
helpless regarding its potential damage that in one sense it had almost 
ceased to have reality for them. Their attitude was largely one of 

let the government worry." In fact, this Same attitude of preferring 


to leave the entire problem up to the “leaders,” the “experts,” or the 
government" has been illustrated in 


Í | Tecent studies reported by the 
Society for the Psychological Study of Social Issues. The main lesson 
in all of this is the enormous psychological distance bet 
of our democracy 


l and the world issues that concern 
appointed leaders. 


The present volume is presented in fair] 
a great deal of tabular material. 
popular consumption. An index a 


gically is not simply the vast 
ation regarding world affairs— 


y technical form, including 
Apparently it was not written for 
t the end would have been heplful. 


STEUART HENDERSON BRITT. 
McCann-Erickson, Inc., New York City. 
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sis. New York: Grune & Stratton, 
II, pp. vii 4-513. 

ed with the principles and practice 
f this work suggests material from 


Wo bere, L. R. Medical hypno 
1948. Vol. I, pp- xi 4-449; Vol. 
These two volumes are concern 


of hypnotherapy- While the title o 
the general field of medicine, its contents indicate that the author's 


main concern is with the field of psychopathology. It may be stated at 
the outset that Wolberg does not consider hypnosis as the ultimate in 
therapy, but rather as a useful and legitimate tool, which like any other 
therapeutic procedure, has its indications and contraindications. 
Volume I includes brief but adequate discussions of the traditional 
topics in hypnosis, i.e. history, phenomena, tests, technics, theories. 
The greater part of the first volume deals with the direct and adjuvant 


use of hypnosis in such varied disorders as alcoholism, psychosis, con- 
neurosis and psychosomatic conditions. In 


version hysteria, anxicty 

addition there is a brief statement of the possible use of hypnosis in 

such miscellaneous conditions as overweight, insomnia, nail biting, 
the discussion of therapy 


ve smoking. Preceding 


f the above conditions is an analys i 
Direct and complete recording of patient-doctor con- 


e cases makes up the bulk of the second volume. The 
histories comprises 85 pages: The cases illustrate the 

val, psychobiologic therapy (guidance, 
ication) and psychoanalytic therapy, 
but the latter case involves, 
nt symptom-free but 
Justification, there- 
al or psychobiologic 
of certain aspects of 
the role of the symptoms in the total 
certain situational factors 
utic goal considered feasi- 


stage fright and excessi 
is of the personality dy- 


in each o 
namics involved. 
versation in thre 
shortest of these 
use of hypnosis in Sy 
reassurance, desensi 
respectively. 
Wolberg warns, the risk à 
without a full understanding of his inner conflicts. 
fore, for the use o is i ymptom remov 
therapy is to be fou 
the personality (ego 
personality, motivati 
(money and time available), anc 


ble. . 
ion of therapeutic goals is too oft | 
d ey ussion of this and the related topic of 


that Iberg, b his tho A pi: 

bd deri zes his most pertinent contribution. Also of 

interest is the author's theory of hy nd his observations concern- 
i d narcosynthesis. In addi- 


ing the relative effectiveness of hypnosis an 
tion, both volumes contain edural suggestions relating to 
, 


a patie 


| the therape 


ften ignored and it is here 


f the trance, eliciting of hyp- 


technics of trance indue » 

i a and se ypnosis- 

notic phenome of this reviewer concerns the tendency of the 
erning such problems as 


The chief criticism } 
author to explain away the discrepant data concer prol 
changed sensory limen, individual differences 1n susceptibility, and 
validity of regressio in terms of aptitudes. These aptitudes are said to 

y the interplay and balance of both conscious and un- 


be determine 
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conscious strivings within the individual. The subject who is hypno- 
tized "against his will' or by a disguise technique is thought to be 
strongly motivated toward susceptibility on the unconscious level, 
while in the cooperative but nonsusceptible person, the reverse situation 
would hold. The data supporting this hypothesis, however, are of a post 
hoc nature and the hypothesis itself is not easily subjected to experi- 
mental verification. Another objection is Wolberg's use of Rorschach 
test results in support of the reality of regression. This would seem to 
assume the existence of well standardized age norms. 

The final chapter on the future of hypnosis is both brief and weak. 
The author argues that in the interests of both medical and lay groups, 
safeguards (i.e. laws) are required which would restrict the use of hyp- 
notherapy to members of the medical profession. There are many 
objections to this conclusion. Not the least important of these is the 
fact that, as the author's own references indicate, many individuals not 
privileged to belong to the ranks of the medical profession have made 
contributions both directly and indirectly to the field of hypnotherapy. 

In these two volumes, Wolberg has presented a well written and in- 
teresting work which makes a definite contribution to the field of hyp- 
nosis. These books may be profitably used as reading material in ad- 
vanced courses in abnormal and clinical psychology. 


F. L. Marcuse. 
Cornell University. SES 


HALL, Victor E. (Ep.). Annual review of physiology. Stanford, Calif.: 
Annual Reviews, Inc., 1948. Pp. xi+552. 


For the psychologist not intimately concerned with specific physi- 
ological processes, there is relatively little of value in the volume. 
Of the 23 chapters, only five are oriented toward problems with which 
the profession commonly deals. These five are: (1) Vision, by A. Chap- 
anis, 24 pages; (2) Physiological Psychology, by W. H. Gantt 26 pages; 
(3) Heat and Cold, by D. H. K. Lee, 22 pages; (44) Anoxia in Aviation, 
by L. F. Nims, 10 pages; (5) Somatic Functions of the Central Nervous 
System, xi w S. McCulloch and J. Y. Lettvin, 16 pages. The re- 
maining chapters contain isolated i ; ^ 
wq poe c croise ; relevant studies for the psycholo 

Gantt's contribution on physiological ps [c i imi 
by the author to the conira cl his oe poen dg e 
lovian methodology and human psychopathology. However ise those 
similarly occupied, the references to current Russian ex erimentatio® 
and the expostulation of the need for a “level of icles ae a com 
to psychobiology are rewarding. Psychologists need the e 
which comes from such considerations of integrative levels as are pre- 
sented in Novikov's paper in Science (March 2, 1945, 101, 209-215). 
The reviewer found Chapanis' chapter a concise and worthwhile 
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salient new studies on color vision. Considering the mass 


survey of the 
he war, it is an achievement to have isolated 


of material emerging from t 


the NUR studies as Chapanis has done 
e , 1 i j 
McCulloch's review 1$ concerned mainly with the research done in 


arc atory and related areas. Most of the references cited at the 
vise 1e chapter are not described in the text. For those dealing witl 
ie phy siological substrata of oxygen deprivation, Nims’ brief s . 
will be of particular interest. It would have been valuable to [omis 
lated the high altitude effects of anoxia with those reported omg xdi 
psychosomatic therapies as insulin and electroshock. Lee's eee hs 

is an effective presentation of the environmental pressures arising pes 
exposure to heat and cold. Many of the data offered are rarely cited in 


psychological literature. 
It is to be hoped that the editors of the forthcoming first “Annual 
Review of Psychology" will follow the example of the editors of this 
volume in their attempt to ensure author representation from all 
parts of the world regardless of "curtain effects." 

BERNARD F. RIESS. 


Hunter College. 
ran nature and their 


ner. The driving forces of hun 
y and psychopathology 


Moore, Tuomas VER 
adjustment: a introduction to the psycholog 
of emotional behavior and volitional control. New York: Grune & 
Stratton, 1948. Pp. y+456. $6.50. 
The reviewer hazard that the late Father Moore of the 
Catholic University of America was emporary psychologist 
(as the title-page de- 


who was not only Ph.D. and an Mw 
ict." Since stereotypes Vic- 
hose ignorant of the 


clares) a “monk of th order of St- 
timize those who “know abou them as well as t r 
ofessional colleagues may be wrongly disposed to 
tement of Thomistic doctrine. 
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view this volume e pe E € 
Actually, it i$ an original an powerfu syn s x 
eee clinical psychiatry with “the Perennial Philosophy: ! 
rts to this treatise which is pitched on an inter- 
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modernity is reve € LET. 
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Part II on the Emotional Life centers about a thorough dempiion id b 
Lange-James theory. The author holds that affections are as hig psychic 
as organic and that "the human being is a psychosomatic unit substance. 
“Faculties” are accepted as real entities on two grounds: (1) the necessary 
existence of mental "functions" to account for “products, and (2) the findings 
of factor analysis pointing to independent or distinguishable mental processes. 

The fourth part is a rich treatment of representative psychopathological 
cases grouped under conventional categories. Here Moore's vast and solid 
learning, as evidenced by the extensive French, German, and Spanish foot- 
notes, and his broad experience in "helping people in trouble" are 
features. . 

The heart of this work appears in Part V which bears the same title as the 
book itself. It outlines a system of motivation with these emphases: “There are 
just as many impulses as there are human abilities"; “the criterion of worth in 
evaluating desires is not pleasure but accomplishment”; self-management 
requires a hierarchy of desires with one supreme end of life to which everything 
else must conform; the self-ideal is an ideo-motor concept; “religion is the nat- 
ural sublimation of human desires, always possible and always effective, no 
matter how great the calamities that confront us”; man has many drives and 


not one. The twin chapters on the “parataxes” of depression and anxiety in 
this section are little works of art. 


In Part VI the “Will” (with a capital W) is rehabilitated by arguing that 
if there is an admitted function of control, there must be a volitional faculty, 
which in turn is essentially a partof the “soul,” or total organizing principle of 
man. Actually, this theory,” while stressed, receives less space than extended 
analyses of experiments on motor response wherein Moore is thoroughly at 
home. 

The seventh and concluding part is devoted to mental hygiene principles and 
a related philosophy of nature and society. There theology seems to win the 
upper hand, for the tone is basically that of a psychology of religion from the 
believer’s standpoint. 


prominent 


Few books are harder to challenge than this one since it seems to 
be unusually free of factual errors. 


1 | r Certain judgments, of course, may 
be questioned, such as his belief that cyclothymia is transmitted in the 
genes, or the opinion that most of t 


hee 1 ‘ he “whining evaders of responsibil- 
ity" he encountered in hospital wards had grown up without altruistic 


urges. One may also wonder whet here is not something spurious 


or at least weak in the ( ?) presuppositions behind his 
clever caricature of the literal superficiality of Wundtianism (or any 
other form of psychological positivism), viz: “There is a flow of con- 
sciousness, but what flows? Nothing. There is sensation and perception, 
but what perceives? Nothing. 

Nobody. There is action, but nothi 


her t 
metaphysical 


There is choice, but who chooses? 


Mice 
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omission, too, in an earne ientifi 

a a st scientific test i 
oe to Langan i ament of excepti i 
e absence of any consideration of the cultural aida vob de : 
proach to personality structure. PREPRE P 

Any one who finishes hi i 

y :s his perusal with tl <peri 
j $ À n he experience of “ 
thou persuadest me must acknowledge that this is a serious =a 
portant contribution to systematic psychology, of potential b efi ed 
wider audience than it will probably have. modam 
GroncE W. H 

B B . i AR 

Teachers College, Columbia University. ii 


A. T. (Ep.) James McKeen Cattell: man of science. 


PorrENBERGER, 
: Science Press, 1947. Vol. I, pp. viii +582; 


1860—1944. Lancaster, Pa. 
Vol. II, pp- viii +503. 
M eter] oem pem spine aie vega nina 
l a a , tea and apostle of academic 
ing together in convenient form seventy of the more 
rs of James McKeen Cattell (1860-1944). The first 
i f his scientific papers in which original 
lume are forty-one papers, 
opular articles. Each volume has a short 
reciation of Cattell, in Volume 
F. L. Wells. 
hich these seventy papers 
d in the back of Volume 
k and constitute the 


outstanding pape 
volume contains twenty 
researches arc reported. 
mostly addresses and semi-p 
foreword by the editor followed by an apP 
Iby R. S. Woodworth and in Volume II by 

Cattell’s complete bibliography, from W 


were selected, contains 167 titles which are liste 
f research wor 


I. Of these 30 or more are reports O 
bulk of Volume I Those selected for the contents of Volume Il are 


largely addresses, book reviews, notes and comments, most of whic 

appeared in Science or The Scientific Monthly. Only two books are listed 
in the bibliography — University Control, published in 1913, and Car- 
negie Pensions, in 1919. Cattell believed that the best way to dissem- 
inate scientific information is not through books but by short papers in 


journals. 


In the second vo 


for Volume 


scientific and popular . 
The editor does not state the basis on which the 41 papers 1 
II were selected. It appears, however, that they are representative 

dvancement of science, its usefulness 


freedom and democ- 
in Volume II are on 
are on topics con- 
one quarter on the 
universities in the 


terests in the a ) 
rn for academic 


of Cattell's broad in 
in daily life, and in 
racy. More than h 

psychology and its applications; about one quarter 
cerning the advancement © i eneral; and 
organization and administrat and 


United States. 
rs are arranged 
In view of 


in chronological order and 
the assertion by the 


, In each volume 

with no editorial change or commen he as 

editor that these papers have more than a mere historical value, one 
i nt was not employed. With the ex- 


wonders why a topical arrangeme 


86 BOOK REVIEWS 


ception of two short papers (and possibly one other) the contents of 
Volume I could have been arranged under four main topics representa- 
tive of the four major areas in which Cattell did his important research. 
'These are: 


1. Studies of reaction times, including measurements of the time taken up by 
the inertia of receptors, by sensation, perception, association, recognition, etc. 
At least ten of the papers in Volume I are reports of experiments in this general 
problem area. 

2. Studies in psychophysics, particularly his famous work on the perception 
of small differences and measurement of errors of observation. The longest 
paper in Volume I—more than 100 pages—is his famous study, “On the per- 
ception of small differences,” done in collaboration with Professor G. S. Fuller- 
ton of the University of Pennsylvania and published in 1892. 

3. The development of mental tests and the measurement of individual differ- 
ences. There are two papers on this topic in Volume I and three in Volume II. 

4. The development of the order of merit method and its applications in his 
famous work on men of science. Ten papers comprising most of the last 200 
pages of Volume I are on this general topic. 


The papers in Volume II could have been classified as follows: 


1. Those in which Cattell discusses technical problems in psychology, as for 
example, the papers on “The time it takes to think," "Reactions and percep- 
tions," “The perception of light" and "Mental measurement." 

2. Those in which he states his general viewpoint in psychology and his con- 
ceptions of psychological problems. For example, “The conceptions and meth- 
ods of psychology." 

3. Papers on the applications of psychology to practical problems, as for 
example, "Practical psychology," The usefulness of psychology,” “Psycho- 
logical methods to promote highway safety.” 

4. Papers on the organization of scientists and the advancement of science. 

5. His extensive studies on the organization and control of American colleges 
and universities, including two or three papers on general education in relation 
to science and democracy. . 

6. And finally, two or three papers on wider social issues—democracy and 
international good will. 

This outline will give the reader a general idea of the contents of 
these volumes. Psychologists will find in Volume I not only the original 
data of Cattell's early experiments but very stimulating discussions © 
problems and methods that are quite relevant to modern experimenta 
work. This volume should and undoubtedly will appear on the reading 
lists for graduate students of psychology. It is a classic that will brighte? 
with age. Cattell did as much as any contemporary psychologist to put 
psychology on a sound scientific basis and far more than any other man 
to establish it as a respected member of the growing family of sciences: 
He was the first man in the world to hold the title ‘Professor of PSY- 

chology” and the first psychologist to be admitted to the Nationa 
Academy of Sciences. 

Although Cattell never attempted to found a school of psychology 
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7 or to formulate and promote à conceptual system, he none the less held 
rigidly to a behavioral point of view and regarded psychology as a 
growing and important science of behavior. "I have always held that 
psychology has to do with the conduct of an individual rather than with 
his consciousness. We are concerned with what a person does, rather 
than with what he thinks he thinks, feels he feels or imagines he imag- 
ines. In psychological experiments, as I once put it, ‘Tt is usually no 
more necessary for the subject to be a psychologist than it is for the 


vivisected frog to bea physiologist” (IL, p- 371). 
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children to call her doll Mr. President, on the esoteric ground that he 
would lie in any position in which he was placed. Of course, the presi- 
dent is by nature as truthful, honorable and kind as the rest of us, and 
is as likely to have more ability or enterprise, or both. But he really 
finds himself in an impossible situation" (II, p. 284). He goes on to 
propose certain reforms in university control which in his judgment 
would clear the way for the progress of science and the welfare of human- 
ity. 

The subtitle of the two volumes is “Man of Science." Cattell was 
all of this and much more. He was a great protagonist of science. 
Science to him was the most vital thing in life, amounting almost to a 
religion. Scattered through his writings, even in his most technical 
papers, are numerous references to the great importance of science for 
civilization and human happiness. It is the basis of democratic society ; 
the discovery of truth that sets men free. It has freed them from toil 
and drudgery, from human slavery and child labor, from physical disease 
and pain, from torturing fears, and from ‘‘jealousies, resentments, and 
passions that lead to war." “Science has not only created our civiliza- 
tion: it has given to it the finest art and the truest faith” (II, p. 406). 

Cattell was not a science-for-science's-sake man. For him science was 
not an indoor sport to be enjoyed by a chosen few. He regarded it as a 
powerful instrument of human welfare. His interest in applied psy- 
chology and his organization of The Psychological Corporation, to 
which he devoted the last years of his active life, bear eloquent testi- 
mony to this conviction. A characteristic statement of this belicf is, 
«|, it appears to me that the conditions are most healthful when 
science and its applications proceed hand in hand, as is now the case 
in engineering, electricity, chemistry, medicine, etc. If I did not believe 
that psychology affected conduct and could be applied in useful ways, 
I should regard my occupation as nearer to that of the professional 
chess-player or sword swallower than to that of the engineer or scien- 
tific physician" (II, p. 206). 

There is an interesting parallel between the careers of Cattell and 
Benjamin Franklin. Both were interested in the advancement of science 
and particularly in its application to the practical problems of life. 
Both were journalists and devoted to the wide dissemination of knowl- 

edge. Each was the founder of a scientific society. Both were socially 
minded citizens: strong believers in the democratic process and bitter 
opponents of tyranny and regimentation, particularly in matters of 
scientific inquiry. There are, of course, important differences in the 
careers of these two great Americans, one of which may be changed as 
time goes on. The name of James McKeen Cattell may some day be as 
well known throughout the world as that of Benjamin Franklin. 


Marx A. May. 


Institute of Human Relations, Yale University. 
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GORER, Grorrnkv. The American People. A study in national character 
New York: Norton, 1948. Pp. 246. i 

The scientific approach to the study of national character is beset by 
three major problems: what do we want to know, how are we going to 
get it, and how shall we interpret what we get? Cultural anthropolo- 
gists interested in describing and explaining the general contours of 
small societies and their cultures have been able to overcome in some 
measure these problems faced by the scientist in the study of large, often 
heterogeneous, national groups. For, almost in one bold swoop, it has 
sible for the culturologist to survey the social struc- 
ture of a small tribe, describe its institutions and how they interlock 
with one another, make predictions as to how these institutions must 
inevitably affect the individuals involved as individuals and as a collec- 
tive, and then to give case histories of a selected sample in order to 
show that the over-all description, explanation, and prediction square 
with the case material, 1-¢-, with how the individuals actually think, 
feel, and act. 'These studies have for the most part approached the 
with psychoanalytic orientation so that not only are the as- 
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tions, one gets the feeling from the heart that not only are forty million 
Americans neglected in the study but that it is a study tailored-to-fit 
the "Anglo-Saxon" section whence the basic character of America 1s 
supposedly derived. This, of course, is a supposition which is taken alto- 
gether too much for granted by Gorer and can easily stand further in- 
vestigation. 

The study is largely impressionistic and based on Gorer's travels 
throughout the United States as well as his discussions with various 
social scientists and his "love and friendships and quarrels and mis- 
understandings and delicate negotiations and casual incidents which 
made up my life in the United States" during a period of seven years. 
Among a host of unrelated, though interesting, impressions and facts 
thus gained, Gorer presents a simple thesis which is supposed to ‘‘ex- 
plain" what gives Americans their national character. 

The essence of the thesis is not new; it is merely put in different 
words than others have done, especially Margaret Mead, an intellectual 
debt which he carefully acknowledges. In brief, we are told that the 
fundamental characteristic of Americans is their hatred of authority, 
and that this hatred stems from the fact that America, like no other 
country,? is a haven for immigrants, or at least has been until compara- 
tively recently. First generation immigrants are depicted as having à 
strong hatred for their respective oppressors and having great hopes of 
adapting as best as possible to the new freedom of America, especially 
in the upbringing of the second generation as more perfect Americans 
than they are, in language, manners, appearance, aims, and values. 
“Tt is this break of continuity between the immigrants of the first gen- 
eration and their children of the second generation which is to my mind 
of major importance in the development of the modern American char- 
acter, which gave rise to what might be called by analogy with genetics, 
the American mutation. ... The individual rejection of the European 
father as a model and a moral authority . . . was given significance and 
emphasis by its similarity to the rejection of England by which America 
became an independent nation" (pp. 26-27). 

There can be no doubt that there is a great deal of truth in this 
theory of "Europe and the rejected father" as a basis for the national 
character of America, but when it is made to answer for almost every 

characteristic, it falls far short of the mark, as should be expected when 
one or two explanations are given for a set of infinitely complex and 
pluralistic social phenomena. When, for instance, we are told that it was 
only ''the less assimilated and less assured groups” who loved Roosevelt 
like a father, and that the remainder had an "almost pathological 
hatred" for his authority, then it would seem that Dr. Gorer is over- 
dramatizing the situation in order to fitit to his theory. It seems to this 


2 Modern Palestine might be an exception to this statement, and to a lesser degree: 
Canada. 
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reviewer that an economist might more readily explain, and rightly so, 
the hatred for Roosevelt in terms of his position vis-à-vis the vested 
interests, and the love for him by the vast mass of the population pre- 
cisely in terms of his authority in dealing fearlessly with these vested 
interests. 

The crucial test of this theory is perhaps in the sphere of religion. 
While religious matters do not occupy the population nearly as much 
as in other countries, they are nevertheless, for the most part, God- 
fearing people endeavouring to do what is "right in the eyes of the 
Lord." "In God We Trust" is a common expression in the U. S 
symbolizing that the American creed of “Life, Liberty, and the pursuit 
of Happiness" shall be guaranteed to all under God and under those 
whose position it is to administer this maximum bonum. Indeed, hatred 
of authority is manifested everywhere in the U. S., but it is in regard to 
interpersonal relations and not so much to all positions of authority in 
the land. It is perhaps because Gorer was not in à position to treat the 
religious question at all, as he points out 1n his Foreword, that he re 
the significance of the "hatred of authority. It is s this paon t gt 
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by Gorer, the theory of ''mommism" may have its place in the consult- 
ing room or the clinic, but cannot, without being ridiculous, explain 
everything from the nursery and the school to the business office and 
the White House. 

A similar criticism could be pointed out when Gorer claims that the 
"frequently expressed fears that America will be reduced to want, per- 
haps to actual starvation, if it lets its foods or resources or money out- 
side the country," are due to the predetermined schedule-feeding of 
infants rather than on the basis of sclf-demand. Even if the “cause” 
for such occasional outbursts could be traced to the anxieties created by 
withholding the breast from the infant, it certainly does not jibe with 
the well-known fact of America's generosity to less-fortunate peoples, 
e.g., the aid given after the Japanese earthquake of 1923, the continu- 
ous sending of food parcels and moncy by all sections of the population 
to relatives overseas, and more recently the vast European Recovery 
Plan. "The great addiction of most American men to milk as a drink” 
as well as the "great erotic fetishistic value given to women's breasts” 
are also taken to be due to this early "anxiety." What is overlooked is 
that not only do they drink much milk but cat and drink many other 
things in greater quantities than elsewhere because the standard of living 
is higher; and further, with respect to the other claim, it has not yet 
been proven that European men are to any degree less erotically ex- 
cited by women's breasts than American men. 

On the question of “ideology” in the American character, Gorer does 
not have much to say, except to point out that it is “a minor compo- 
nent." While this assertion may be quite true, it seems likewise probable 
that it is true for the national character of most other peoples.3 On the 
other hand, Northrup, Myrdal, and others have found a great deal of 
ideologism in the American character, so that to say that “ideology is a 
minor component of Americanism” is at best a controversial contention. 
Furthermore, some studies in the Psychological Barometer series would 
tend to show that Americans have pretty strong ideas (whether right or 
wrong is beside the point) regarding Americanism which guide their 
conduct, at least to some extent.4 

Perhaps the most important contribution of the book is its novel 
analysis in bipolar terms of certain typical relationships: child to father 
child to mother, parents to children, husband and wife, and so on. 
Though this type of analysis was first suggested by Gregory Bateson, 
it is the first time as far as this reviewer is aware, that it has been a5 
extensively applied in a study of American national character. In the 


8 Possible exceptions might be found among the French, Russians, and Israeli. 
* Cf. for example, Henry C. Link, What does Americanism mean to the American 
people? J. appl. Psychol., 1947, 31, 425-430. 


we 


BOOK REVIEWS 93 


themal analysis of the various relationships he has chosen to study, 


Gorer gives us both an excellent structural and dynamic picture. Amer- 


icans step out of print and somehow seem more real than they do in the 


rest of the book. It is a pity this analysis was not extended further. 

In fine, notwithstanding the many criticisms that could be levelled 
at the book, Gorer deserves the thanks of all those who are interested 
in finding out what makes one people be what they are as compared to 
another people. Anyone who has thought and worked on this problem 
is well-ware of the difficulties involved in making generalizations which 
could easily be accepted by his co-workers. Like Newton’s third law, 
to every generalization in national character there seems to be an equal 
and opposite gencralization. This virgin field is wide-open for research 
and anyone who seriously attempts to close the wide gaps existing 1n our 
knowledge is making a big contribution. But a note of warning seems 
in place: analysis in depth (e.g.; psychological interviewing, description, 
and analysis) can serve a very useful purpose only if it is then related to 
an analysis in breath (e.g., macroanalytic analysis of institutions and 
social structure as well as systematic polling and attitude studies) and if 
both are seen in the light of historical and contemporary social, politi- 
cal, and economic dynamics. Each approach separately tells only part 
of the story. Only an interlocked aggregate of the guum pe can 
give us the correct picture. This is a hard order but, in a world suffering 


greatly from international misunderstanding, the results would be worth 
the eff 
ort. Percy BLACK. 
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THE PREVOST-FECHNER-BENHAM SUBJECTIVE COLORS 


JOZEF COHEN AND DONALD A. GORDON 
University of Illinois 
has perhaps the most 


Color vision, of all psychophysical phenomena, 
elegant scientific formulation. The color manifold can be represented as 


a closed three dimensional space with reference to three arbitrary pri- 
maries. It thus fulfills the space requirements of solid analytical geom- 
etry and all transformation formulas referring to vectors in space can be 


made to apply-' 
If the intensity function is eliminated by projecting the xy 271 
plane onto the X) plane, then the resulting configuration, called a 
chromaticity diagram; fulfills all requirements $0 that all theorems of 
projective geometry for closed spaces can be made to apply. We do not 
believe that there is any other logical or biological f unction which 
may be so adequately described by mathematics 50 concise. 

This remarkable fact has been noted by the mathematician, H. Weyl. 


With respect to the isomorphism between the projective plane and the 


sensation, he adds (83, P- 18), 

plane and the c e isomorphic with one 
which is correc is transferred un- 
A science can n 

sentation. The idea of isomorphism indicates 
the self-understood, insu ple barrier of knowledge. It follows that to- 
ward the ‘nature’ of its obj tence maintains complete indifference . . - 
For example, what distinguishes the colors from the points of the projective 
plane—one can only know in immediate intuition. 


another. Every theorem 
changed to the other. ++ 
except up to an isomorphic repre 


put in recent years some workers 
of the space. This is stated 
69, 70). This latter series of 
in the mathematical con- 
the space is Desarguesian, and 
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The experimental fact that all colors may be reproduced by three 
real primaries involving negative coefficients, or three imaginary pri- 
maries involving only positive coefficients, leads to the inevitable 
hypothesis that there are three receptors in the retina, each of which is 
associated with a so-called “fundamental sensation curve." The enigma, 
the great enigma, of color vision is that azy three points (not on a 
straight line) will serve as primaries, since lines connecting them will 
completely span the projective or chromaticity plane. Mathematically, 
any three points, corresponding to the primary colors, will produce à 
unique set of fundamental sensation curves, and are therefore as good 
as any other set of three points. If the object of science is prediction, 
then any set will predict color mixture data as well as any other set. 
However, if a choice of primaries can be made so that they predict some- 
thing more than just the phenomena of color mixture, then they are 
desirable from the point of view of parsimony. People have, of course: 
done just this. Helmholtz selected his primaries on the basis of the fact 
that there are no changes in hue from 670 millimicrons on out to the long 
wave end of the spectrum, Hering on the basis of after images, and Ladd- 
Franklin on a phylogenetic basis. For a fine discussion on the nature and 
transformation of these primaries, see Judd (48). 

There still remains an outside criterion for the selection of the pri- 
maries which has received little consideration from workers in color. 
We refer to the phenomenon of subjective colors, first discovered by 
Bénédict Prevost 122 years ago. We propose to present here a detailed 
history of the experimental work on these colors, followed by a brief dis- 
cussion of the logic by which these colors may be made to determine the 
primary receptors and the corresponding projection and fundamenta 
sensation curves. 

The history of the phenomenon divides itself into several natural 
divisions, and the analysis will be in terms of these divisions. 


I. Prevost’s DISCOVERY AND THE First PERIOD or OBSERVATION 


Bénédict Prevost (62), a French monk, discovered the subjective 
colors in 1826. He observed a ray of white light in a darkened chambe™’ 
He moved a white, rectangular cardboard perpendicularly to the ray: 
such that the ray was cut at successive intervals. He noted that the 
ray was white, at first, then a violet in the center, then a deep indigo 
then a greenish-yellow followed by red. ‘One sees," he writes, "that the 
light of the ray ... has been decomposed as if by a prism, into the 
seven principal colors ranging in almost the same order.” 

This is not to say that Prevost believed that he had actually de- 
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composed the light as Newton did. On the c 7 ; 
that the effect was physiological in nature, and oe. wick 
etie hs Presumably, there are substances which com- 
i nerves in the retina, giving rise to the sensations of color 
4 hese substances act at different rates. Since white light is com aged 
of different wave-lengths, intermittent white light should oni ie the 
subjective colors seen. Since the concept of three receptors was un- 
known to Prevost, his theory compares most favorably with those of 
later more sophisticated investigators. 
The first rediscovery of the subjective colors was made by G. T. 
Fechner (33) in 1838. It is Fechner who is usually credited with the 
original discovery. It was well known to Fechner that a disk whose 
composition is partially black and partially white will produce a gray 
when rotated, the function of the ratio 


luminosity of the gray being a 
of black to white. He noted, too, that the disk must rotate above the 
fusion frequency. 


In order to produce various gradations of gray, Fechner prepared a 
disk which was divided into eighteen concentric circles, the diameter of 
each circle differing from the adjacent circle by a unit length. The in- 
nermost circle was completely black, the outermost circle was com- 
The remaining circles varied in the relative amounts of 
as illustrated in Fig. 1a. The disk included in the Fech- 
ly seven concentric circles. This was done so that 
be clear, but Fechner never used such a disk. It 
buted to Fechner by many later investigators, for 
(24) 

f the black figure ( 


has some degree of merit. 


pletely white. 
black and white 
ner paper shows on 
the diagram might 
is erroneously attri 
example, Colenbrander 
The circumference 9 É 

forms an Archimedes’ spiral. While trying to SP! 
the fusion frequency, 


above the fusion frequency, Fechner spun it below 

and was amazed to sce the many and varied colors which were produced. 

notes, were not saturated, and yet were unmistakable. 
them in various degrees of distinctness. Some called 


some saw them only indistinctly, but all agreed that 


disregarding the corners) 
n the disk at a speed 


The colors, he 
Other persons saw 
the colors brilliant, 
some color was evident. . 

Now Fechner begin i t with the 
subjective color. Spee of rotation and direction of are fa 
If white partial circles are used on 2 black background, as in pe 1b, 
colors are again produced although not in the same ome " before. 

If a disk of the type shown in Fig- 1cis rotated e n h pain i a 
is produced as expected. At moderate rotation the dis dri ow; 
at successively increasing speeds, it passes through yellows h-green, 


possible determiners of 
rotation are factors. 
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Fic. 1. (CONTINUED): Disks FOR THE 
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t, Wolf; Fig. 1u Bidwell; Figs. 1 


OF THE SuBJECTIVE COLORS. 
Figs. 1p, 1q, 1r, Hurst; Figs. 1s, 
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green, and light blue. A “marbling” is interwoven through the colors 
appear on the disk. When the blue phase is reached, the whole disk is 
covered with a tapestry-like design of multi-colored cells. The nature 
of this design is suggested in Fig. 2. We have succeeded in reproducing 
this cellular design in the Illinois Laboratory in a clear and striking 
fashion. The drawing in Fig. 2 is not from Fechner, but from our own 
laboratory. The bases of the cellular formation are not clearly under- 
stood, and the existence of the formation has been noted, as will be 
seen, by many later investigators. 


Fic. 2, CELLULAR DESIGN OF A HEXAGONAL NATURE ORIGINALLY REPORTED BY 
FECHNER. 

The design accompanies the subjective colors. The drawing is from experiments per- 
formed at the Illinois Laboratory. 


When the speed of the rotating disk is increased ever-so-slightly t° 
the fusion frequency, the disk quickly fuses to gray. 

Fechner advances a theory concerning the mechanism of the sub- 
jective colors. Imagine the disk to be rotating. When the observe! 
fixates the disk, black replaces the white (which only an instant before 
replaced the black). The impression of the white does not disappeaf 
immediately, and, as a matter of fact, lasts for some time as is demon” 
strated by, as Fechner says, “the phenomenon of after sensations” 
The perception of white is dependent upon many receptors in the ey® 
probably three. But the rate of fall (Fechner does not mention the rise 
but should) of each of these receptors is different. And it is this differ" 
ential fall (and rise) which is the basis of the sensation of color. 
moment later the eye is stimulated by white light again, the white 1 
replaced by black, and the entire cycle repeats itself. 

Dove (26, 27) in 1848 noted the relation between Fechner's co 
and those produced as a simple after sensation. He describes dis 
similar to those of Fechner, with the difference that white is replace 
another color. When the disks are rotated he describes the resulting ge 
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ception. The paper is of theoretical value since it demonst 

lation between after sensations and subjective eget queue 
are really subjective—but makes no really significant ees 
Dove's disk is reproduced, after his description, in Fig. 1d. ned 
A x 1859 John Smith (73, 74) made the second. rediscovery. He be- 
ieved that the subjective colors were a direct result of the physical char 
acteristics of light and that the new theory of light which he is : ped 
would replace that of Newton. We reproduce verbatim a muria el 


Smith's abstract. 

The author produced the same results by cutting out spaces in a white card. 
and causing it to revolve on a black surface. He produced also similar phe- 
nomen by causing these figures to revolve when held perpendicularly, and to 
take the appearance of coloured solids. He also caused these colours to be re- 


flected on a white surface from the revolving disk. 
Remarkable as these experiments are, they are not more remarkable than 


the results they lead to. 
y of the ether. 


They prove the homogeneit 
They prove the undulatory hypothesis, but o 
t into the theory 


They show the necessity of introducing à 

of colour, or that colour is the effect of two coordinate sensations—a positive 

and negative. 

_ They enable us to dispense with the different 

light, taught by Newton. 
They remove the necessity 

position in matter to produce wave 


They help to explain the phenor 


light. 
They give a new explan 


and simple manner many very 


Smith indicates that there ar 
only four. Fig. 1e illustrates T: 
description. One may conclude that Smit 


the theoretical aspects © 

Rood (63) in 1860, realiz 
John Smith was invalid, since t 
He confirmed the existence of 
origin. 

Rood first observed the subje 
ure the time of small explosions © : 
For this purpose he used a black disk wit 
in Fig. 1f. A certai d the Jower par 
green, while at slo entire flame appeare 
red. At very slow spe ion, the appearance 
nated between white 


ppose the undulatory theory. 


refrangibilities of the rays of 


for the supposition of different waves OF of a dis- 


s of different lengths. 
nena of what is called the polarization of 


tic refraction, and explain in a plain 
natural phenomena. 

disk, but describes 
structed from his 
isinterpreted 


ation of prisma 
interesting 
e five rings on his 
Smith's disk, recon 
h had perhaps ™ 
f the subjective colors. 
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he color appearances were subjective. 
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hile attempting to meas- 
by use of a rotating disk. 
h four radial slits, reproduced 
t of the flame to appear 
d to be a purplish- 
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A more elaborate disk of the type illustrated in Fig. 1g was now 
constructed. lt was nine inches in diameter, while the slit was one 
half inch at the circumference. The disk was rotated against a white 
cloud. At a rotational speed of ten revolutions per second, the cloud 
appeared to be a deep red. ae 

The disk was viewed through a colored filter whose transmission 
properties were such that it transmitted the red, orange, yellow, some 
green, but excluded the blue and violet. If radiations coming from the 
disk were red and only red, then the filter should transmit these rays and 
the resulting sensation should be red. The sensation, under these con- 
ditions, is not red but a greenish-blue, indicating the subjective nature 
of the phenomenon. The subjective nature is further shown in that if 
a yellow glass is employed, the disk appears violet; if a green glass be 
used, the disk appears green, neutral, or faint red. 

Brewster (16) in 1861 made the third rediscovery of the subjective 
colors before becoming familiar with the work of Prevost and John 
Smith. He observed the colors while watching a rotating series of slits 
pulse a white light source. The colors, beginning at the center of his 
observational field, were as follows: white or bluish, darker blue, white: 

a well defined dark ring, white, greenish yellow, reddish. He suggests 
that similar phenomena may be seen in a flickering gas flame. 

Using an apparatus consisting of a rotating disk in front of an il- 
luminated ground glass, he was able to reproduce the cellular, hexagona 
pattern reported by Fechner. He reports, too, an additional quad- 
rangular “ - - pattern, which is too evanescent to permit it to be drawn: 
[which] consists of a series of dark quadrangular spaces separated by 
triple or multiple lines of light. It has no relation whatever to the hex 
agonal pattern, and has never been seen unless the eye has been stronglY 
pe ae a i Successive impulses of the luminous disk." He sug” 

B at the hexagonal forms may represent cells in the choroid coat: 


he is uncertain concernin sine 
£ the origin of tl y 
Rood (64) published S he.quadrangular forms 


: a short note in 1861 in which he reviewed the 
meager literature on subjective color then available. He suggested that 
the hexagonal pattern r 


s eported by Brewster might be related to simila 
observations reported by Purkinje and Vierordt (see 43, p. 257). 


II. HELMHOLTZ TO STEWART—THE SCIENTIFIC FORMULATION 


The genius of Helmholtz began the second well defined historica! 
period. Here we find the first formal applications of scientific principle* 
Helmholtz (43) in 1860 (translated in 1924) pointed out that if an ob- 
server fixates a Maxwell disk with black and white sectors, he notice? 
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t — 
rigs jae s - Ls on the advancing edge and bluish on the other 
Vac pent o_o the red becomes reddish yellow, and the 
ml the ieee doce n ghee eem the red becomes pinker, 
a ce y en. greenish. With slower rotation," he writes, "the 
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: à reads o Š 
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Wo. orem discesa ient ve qe 
dur n. m n somewhat more easily by use of the 
7 Practice is necessary to see the colors 
pparently, the eye must be fatigued by the flicker to arrive at the 


optimum conditions He re 

. ports the hexagonal cells. If a partiall 
termes and partially black disk is rotated, the observed P ud c i 
ade and be non-homogeneous. He believes that the “fluttering heart" 


illusion is related to the subjective colors. 
Helmholtz theory is that of Fechner— 
a differential rise and fall. The blue, says 


red. 

Brücke (19) utilized disks illustrated in Figs. 1); 
p to show that Talbot's Law, as a result of Fec 
“a subjective color, was not true under all conditions. Following a the- 
ein discussion of the relation between the objective stimulus and 

e resulting sensation, he spun his disks in order that he determine 
which speed produced the brightest appearance. The brightest color, 
obtained when violet just about goes Over to blue, was observed atan 


of 17.6 white pulses per second. 
f the subjective 


in their maximum saturation. 


that the three receptors have 
Helmholtz, lags behind the 


1k, and 11. He pro- 


hner's experiments 


colors was closely 


average rate 
He concluded that the perception o 
allied to what modern investigators call adaptation or fatigue, ^ the 
i dy stimulation of the retina. Ac- 
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the subjective co 
mple, after images, 
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perception brought a 
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as any other color experience, 
tire discussion is confused by t 
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Aubert (4) included a discussion of the subjective co 
book on physiological optics in . He indicates that he sees the 
colors on Brücke's disk at the greatest saturation after a fixation of 
d of the disk is à determiner of the 

he speed 


about one half minute. The spee 
he disk is decreased, then t 


colors seen. If the illumination oft 
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of rotation of the disk must be increased in order that the appearance 
remain constant. 

With reference to the Brücke disk (Fig. 1j), he says that if the fres 
quency of the white pulses is more than 56 per second, then the ring 
appears homogeneously gray. If the pulses are between 42 and 48 per 
second, and an outside part of the ring is fixated, then the part in direct 
vision will be gray while the part in indirect vision will appear to be a 
Berlin blue. As the speed decreases the extension of gray in direct 
vision decreases. The ring does not appear entirely uniform, being in- 
terposed with dark and light radial stripes, and blue and gold become 
apparent. As the speed is decreased the blue and gold become more in- 
tense and the square (hexagonal?) cell structures are seen. As the 
speed of the disk continues to decrease the extent of the blue field de- 
creases until it disappears at 10 pulses per second. Various other color 
phenomena are described. 

Exner (32) in 1870 also observed that the subjective colors appear 
to differ when viewed in the center or the periphery of the eye. His 
introspective descriptions are given in great detail. The colors, he 
points out, are seen in greater saturations when the eye is fatigued. 
But the fatigue of one eye does not facilitate the perception of subjective 
colors in the other eye, and it would follow, therefore, that this aspect 
of the subjective colors is not a central effect. 

Exner reported the presence of quadratic or six sided pointed ele- 
ments, and expressed his belief that an explanation of them would be 
of fundamental importance to physiological optics. 

F. J. Smith (72) made the fourth rediscovery in 1881. He theorized 
that light might be decomposed by interruptions proportional to the 
wave-length of a particular ray of the many, which together are the 
composite ray. He used a large bicycle wheel between the sun as a light 
source and the observer, and produced the subjective colors. Performing 
simple experiments he believed that he had derived the wavelengths of 
the spectral colors. 

The fifth rediscovery was made by Hannay (42) in 1881. While working 
on dry photographic plates in a room illuminated by a ruby light, he 
noticed that movements of his fingers produced a greenish blue, with 
blue predominating. He independently derived the Fechnerian theory 
of the rise and fall of the receptors, and suggested that this phenomeno? 
might be used as a basis for determining the fundamental color sen* 
sations. 

N. Smith (75) stated in 1882 that Hannay’s explanation (the Fech 
nerian theory) was suitable for colors seen in the photographic dark- 
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room, but was unsatisfactory for colors seen on a rotating disk. He 
offered no alternative explanation. 

Stewart (76) made the sixth rediscovery in 1887. Investigating Tal- 
bot’s Law, he noticed that observation of a rotating mirror gave rise 
to the subjective colors. He described the results of careful observations 
under varying lights of different intensities. 

Stewart independently derived the Fechnerian theory of the differ- 
ential rise and fall of the color receptors. Further, he determined the 
mathematical nature of the different response curves from his experi- 
mental results, and derived the conditions under which they must 
operate. Because of their great importance, his proofs are summarized 
here. Stewart indicates that there are three possibilities concerning the 
nature of the curves: 


Case 1 is illustrated in Fig. 3a, where the rise of each of the three receptors is 
linear, all three begin at the same instant, but the rate of rise for each of the 
three is different. The curves are shown on three separate horizontal axes, but 
actually they are superimposed one on the other. The color seen at any given 
instant, say at time X, is the ratio I,:Ig:I3. When I, I; I5 each reach their 
maximum, then white is seen. For this case, although different colors can be 
seen depending on the ratio I,:I2:I3, the color will not change as the receptors 


reach their peak of excitation. Experimentally, the subjective colors do change 


in time, and case 1 is therefore impossible. 

Case 2 is illustrated in Fig. 3b, where the rise of each of the three receptors 
is linear, but all three do not begin at the same instant, and again the rate O 
rise for each of the three receptors is different. It is obvious that subjective 
colors may arise under these conditions, since one receptor may be respondin£ 
while the other receptors are not responding. It would be a more powerful proo 
if the assumption of linearity could be eliminated, bringing us to the next case 

Case 3, where the rise of each of the three receptors is non-linear and, it is 
assumed that all three begin at the same instant, is illustrated in Fig. 3c. Tho 
color (which in this instance is white) seen at time X, will be given by the ratio 
A1 Bi AsBo: AsBs. The color (which is not white) will be different when the 
ratio is abı :azbz:asbs at time X», since these ratios differ from the former ratios: 
This explains the mechanism of the subjective colors. 

Fig. 3d indicates the responses of three receptors under pulsing white light 
pulses which are of such short duration so that the stimulus is almost never 
perceived as white. The assumption is made that the pulsing light sets up 
different phase relationships among the receptors, since the natural periods B 
each of the receptors is probably different. Observations indicate that as the 
intensity of the pulsing white light is increased, the subjective colors tend t0 
ward violet. The explanation is found with reference to Fig. 3d, where it 18 
assumed that the change in rate of rise is most appreciable in those curve? 
which, at a given speed, are less steep. 


Stewart reports many experiments in which colors (some mono” 
chromatic) other than white were pulsed. He reports many subjecti V^ 
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ee which indicates that more than one receptor is being stimulated. 
. harpentier (21) rediscovered the subjective colors for the seventh 
time in 1891. He observed the phenomenon by looking at light which 
was intermittently pulsed through holes. In a second paper (22) he 
discussed the theoretical implications of his “discovery.” He reports 
that intensity of light is not a factor. He states that the subjective 
colors tend to refute the Helmholtz trichromatic theory of color vision, 
because, all colors are reproduced, not just three. The alternative 
theory which he proposes is not reasonable. In a third paper Char- 
pentier (23) describes an additional technique for producing the colors. 
The paper is not of great importance. 
R Sanford (65) was apparently the first individual to include the sub- 
jective colors in classroom teaching. He discussed an experiment on 
the colors in his course in physiological psychology in 1893. 


N 1894-95. THE REVERSION T 
re began in England a period of con- 
ed the artificial spectrum top. The 
] Nature, and were keynoted 
top (84), illustrated in Fig. 
nts colors in an extremely 
position of the 
d. The Ben- 


III. REDISCOVERIES I O OBSERVATION 


In the years 1894 and 1895 the 
siderable interest in what was call 
discussions appeared in the British journa 
by a description of the so-called Benham 
1m. This top, sold extensively as a toy, prese 
striking fashion—and is even more interesting since the 
colors is reversed when t otation is reverse’ 


he direction of r 
ham disk constituted the eighth rediscovery. 


ed an address to the Philosophical Society of 
hich he presented a variety of disks similar to 
ted a theory very similar to that of Fechner. 
discussion by saying that Liveing's theory 
t, since the colors could be seen under the 
de clear as to why the 


was wrong and his 
t for the colors under these condit- 


illumination of a so 
Fechnerian theory would no 
ions. 

Liveing (53) replied that he had observed the top under the sodium 
light, and had observed colors, although the colors were not the same as 
those observed under white light. He too fails to state that the Fech- 
nerian theory will account for the colors, but merely repeats that the 
colors are of a subjective nature. 


Abney (2) rotated the Benham 
olors were O 


matic lights. The following colo! 
No. 3, and No. 4 are the triple lines of the top 
from the center out: 


t accoun 


top in white and near-monochro- 
pserved, where No. 1, No. 2, 
(see Fig. 1m) in order 
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With white light: No. 1 crimson, No. 2 olive green, No. 3 gray (slightly 
violet), No. 4 dark violet; with red (C light): No. 1 red, No. 2 lighter red, No. 3 
very light olive green, No. 4 darker olive green; with green (Magnesium b): No. 1 
bluish green, No. 2 lighter bluish green, No. 3 lighter bluish green, No. 4 ruddy 
black; with blue (near the blue Lithium): No. 1 grass green, No. 2 lighter grass 
green, No. 3 lighter grass green, No. 4 ruddy black; with violet (all the violet of 
the spectrum): No. 1 light violet, No. 2 light violet, No. 3 light violet, No. 4 
darker violet with a suspicion of red. 


Abney states, "When a red a little below the red lithium line was 
employed, all the groups appeared dark red, and, as in the three sen- 
sation theory this part and the violet are simple sensations, the results 
obtained in these last were to be expected.” 


Continuing with mixtures of two simple colors, the following results 
were obtained: 


With a mixture of red and green to make white: No. 1 indigo blue, No. 2 
reddish orange, No. 3 reddish orange, No. 4 darker orange; with a mixture of 


yellow and blue to make white: No. 1 sky blue, No. 2 sage green, No. 3 sage green, 
No. 4 bluish black (perhaps black). E 


These results are extremely important in the theoretical application of 
these colors to the isolation of the primary receptors. 

Finnegan and Moore (34) constructed two disks illustrated in Figs. 
1n and 1o. They found that the width of the lines was highly important. 
If the lines are too broad, then color appears only on the edge. Further, 
the sequence of black and white with respect to the lines and the rate of 
rotation are determiners of the colors seen. 

Edridge-Green (29) pointed out that the subjective colors of the Ben- 
ham top were previously discussed by Helmholtz and Fechner. He 
reports the hexagonal cell structure when the Benham top is rotated. 

Benham (10) mentioned in a note that his theory (which is im- 
probable) would adequately account for the change in colors under 
change in rate of rotation mentioned by Finnegan and Moore. The 
statement is not clear. 

Turner (80) suggested an apparatus whereby a glass spectrum top 
is inserted in the slide holder of a projector, rotated and projected, so 
that demonstrations might be made to large audiences. He also pro- 
posed the use of colored filters for additional effects. 

Newton and Co. (55) called the attention of the scientific public to 
the fact that the Benham disk was copyrighted and that the use of 
the disk for projection was prohibited. 

Hurst (45) then suggested three new disks which were not copy- 
righted. They appear in Figs. 1p, 1q, and tr. 

A note appearing in the British journal, Engineering (86), made 
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mention of the Benham top. A theory of one MacFarlane Gray i 

discussed. According to Mr. Gray, the subjective colors are cia ee 
the color aberration of the lens of the eye. He proposes an Pawan 
which compensates the lens, under normal conditions, for every change 
in seen color. When compensation cannot be made quickly, subjective 


colors are seen. 
These notes were followed by still another in the Scientific American 
(85). The Benham top is again presented. MacFarlane Gray’s theory 


is discussed with the statement, "It does not, perhaps, present the 
degree of clearness and precision to which we are accustomed in the 
study of optics. The field remains open to investigation for varying the 
experiments and completing the first exposed.” Two new disks, de- 
signed by Charles E. Wolf, were presented and are illustrated in Figs. 
1s and 1t. The disk in Fig. 1s is the obvious extension of the Benham 
disk, since the lines will produce finer gradations of color. Fig. 1t is 
the same as Fig. 1s, with the difference that the lines have been con- 
nected so as to produce continuous variations in color. 

James McKeen Cattell (20) concluded the 1895 series of notes by 
pointing out that the spectrum tops were introduced to modern science 


by Fechner, Briicke, and Helmholtz, and hence were not patentable. 
Henry (44) in 1896 attempted to explain th ors of the Benham 


e col 
top by the differential sensitivity of the periphery and fovea of the 
retina. He even went so far as to determine the different sensitivities 
of the foveas of several individuals. This is not the mechanism of the 
colors, as will soon be seen from the experiments of Doniselli and 
Piéron. 

In 1896, Schwartze (66), in an appendix to his book on molecular 
physics, attempted to show that “the Newtonian hypothesis -is 
based on the absurd assumption of the multicoloration of the uni- 
colored light," by use 9 ' disk. This he did unsuccessfully. 

Bidwell (11) in 1896 mentions several new methods of producing sub- 

:milar to those of Charpentier, to 
Bidwell believes that the colors of the 
his theory that color fibres ad- 
e retina continue to respond 


after a stimulus has ceased, by 4 process of "sympathetic reaction." 
r Bidwell (12) in 1897 stated that the greenish blue 


In a second pape : 
subjective nota be + his previous experiments was due to the 
after image of the subjective red color. This 1s rather remarkable for, 
as Bidwell himself states, the greenish blue may be much more intense 
than the red. He made the additional observation that the illumination 


Benham top may be 
jacent to those of the sti 
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is a determiner of the color seen. He found, further, that an observer 
may not be able to distinguish between a real and a subjective red. In 
the following year, Bidwell (13) reviewed his experiments and published 
a picture of his disk which appears in Fig. 1u. 

A note then appeared in the French journal La Nature (41). The 
colors were demonstrated by means of a half black, half white disk, 
with a triangular cut out (similar to Bidwell, see Fig. 1u). When rotated 
in front of a red field, one sees blue-green instead of red. The theory is 
advanced that the colors are due to retinal fatigue. 


IV. BAGLEY AND DONISELLI. THE PSYCHOPHYSICAL APPROACH 


In 1902, Bagley (5) began a new period in the history of subjective 
studies with the first, and one of the best, systematic investigations of 
the phenomenon. Included with this study is the first extensive histor- 
ical survey. f 

Miss Bagley utilized ninety-seven modifications of the Benham disk, 
three of which are illustrated in Figs. 1v, 1w, and 1x. Systematically, 
Bagley studied the effects of the length of lines on the color seen, the 
effect of the width of the line on the color seen, the effect of the dis- 
tribution of the groups of lines on the disk, the effect of background color, 
the effect of the amount and distribution of background color. 

Spinning an ordinary Benham disk, she reports that black followed 
immediately by white shows the lines to be red. When black is both 
followed and preceded by white, the lines appear to be green. Varia- 
tions of the amounts of white vary the color of the green. Black pre- 
ceded by white shows the lines to be blue. When lines which appear 
to be red green, and blue, under normal rotation are lengthened, the 
colors appear to be less intense but not less saturated. As a general 
rule, the width of the lines has no effect on the hue seen, but it may effect 
the other color variables. 

When the black sector is varied, experiments show that the colors 
seen are not dependent merely upon position, but upon actual duration 
of the stimulation. With an increase of black, the reds are emphasized. 
When white is increased, the blues are emphasized. In bright daylight, 
the blue is emphasized. In less intense artificial light, the reds are 
emphasized. 

Reversal of rotation of the Benham disk does not give an exact re- 
versal of the colors seen. The hues are reversed, but the colors appear 
in different saturations and intensities. 

When a gray screen is placed about a particular ring of one of the 
disks, the effect is to decrease the color seen of the ring. When the white 
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of the Benham disk is replaced by a physical color, very interesting re- 
sults are obtained. Lines which would appear red if on a white back- 
ground, appear green if on a red background; lines which would appear 
blue if on a white background, appear to be reddish brown on a dull 
orange background. These experiments illustrate the contrast effects 
possible with subjective colors. 

Bagley notes some of the general conditions of observation. Short 
periods of keen attention aid in the perception of colors. Visual fatigue 
tends to deaden the colors. Practice, contrary to Helmholtz, is not 
necessary to see the colors. There is an optimum speed of rotation at 
which the colors are seen most clearly. 

Miss Bagley relates her results to the Ebbinghaus theory of vision. 
Her theoretical discussion is not clear and will not be reviewed here. 

Doniselli (25) in 1907 published the second of the important experi- 
mental papers. He constructed a Benham disk having a 225? white 
sector, a 135? black sector, and a variable white section which allowed 
the black to be reduced to 4 of the white section. The disk and its 
analogue, a design to appear on a kymograph drum, are illustrated in 
Figs. 4a and 4b. Using this variable disk he systematically investigated 
the effect of the black/white ratio, the speed of rotation (2-17 revolu- 
tions per second), and illumination. 

Doniselli presents his results in a systematic form. They are shown 
redrawn in Fig. 4c. Apparently, the color seen is, in most cases, a linear 
function of the speed of rotation. At greatest speeds, the colors in the 
rings correspond to the natural order of the spectrum. At lower speeds 
there are inversions. 


In his theorizing concerning the cause of the phenomenon, Doniselli 
writes as follow: 


a. Every sudden stimulation by intense white light of brief duration, gives 
rise to a primary impression which is predominantly red-yellow. 

b. The primary red-yellow phase is followed by a secondary blue phase- 
The speed of passage from the primary to the secondary phase is directly pro- 
portional to the intensity of the original white stimulus. 


c. When the intensity is high enough, the primary phase is too rapid to be 
seen, and the colors seen appear to be reversed relative to those seen at medium 
intensities. 


Doniselli presents a mechanism for the subjective colors which is 2 
direct analogue to a vibrating string varying in tension, which produces 
tones. Doniselli has done what so many other investigators have done— 
he has confused the subjective colors with colored after images. Finally 
Doniselli eliminated the hypothesis of Henry that the subjective colors 
were caused by the differential distribution of the cones in the retina: 


a 
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by demonstrating the colors on a sheet of paper rolled about a cylinder 
(Fig. 4b). 

Sexton (68) discussed the Benham top in 1907. He indicated that 
the color was affected by the position of the sector lines, the length and 
width of these lines, the direction of rotation and the illumination. He 
independently derived the Fechnerian theory of the differential rise and 
fall of the receptors. He observed that the fact that colors are seen in 
sodium light in no way disproves the Fechnerian theory—since the 
sodium light affects the three receptors. 

Percival (57) in 1909 became interested in the problem, but simply 
repeated Stewart's experiments and presented Stewart's diagrams and 
theories. 

Baumann (6) published the first of three papers on the subject in 
1912. He described a simple apparatus for rotating the disks, and de- 
scribed the colors of the Benham disk which he erroneously attributes 
to Schwartze. He presents three new disks. In a second paper (7), he 
states that the smaller the image of the disk upon the retina, the more 
vivid are the subjective colors. In his last paper (8), published in 1918, 
he presented five new disks which are minor variations on Wolf's tops 
published in the Scientific American (sce Figs. 1s and 1t), to which he 
makes no reference. Factors which influence the colors seen are the 


nature of the eye, the size of the sectors, the speed of rotation, and the 
illumination. In direct sunlight he sees no colors at all. He proposes 
a theory based on the co although conducting the 


ncept that nerves, 
colors seen, cannot conduct more than a d. The theory would 


certain loa 
appear to be improbable 0. 
ted that the subjective colors 


Edridge-Green (30) in 1916 repor 
ating on a kymo- 


could easily be seen on a black and white design rotating 
he says, are the result of after images. It is 


graph drum. The colors, B i i 
difficult to understand how, as he says the after image of black is 
green.” 

WHITE STIMULI 


V. INTEREST IN NoN- 
8 Ives wrote two papers which, together, constitute 
Actually, they constitute 


f the subjective colors. 
tions to the field of color, 


one of the really great experimental contribu 
but Ives was not only unaware of their importance but was completely 


amiss concerning the theoretical implications of his findings. This is all 
the more interesting, since Ives was 4 man of monumental proportions 


in the field of physiological optics. —— 
In the first paper (46), the assumpti 
ulus is transmitted through a layer of m 


In 1917 and 191 
the ninth rediscovery © 


on is made that the visual stim- 
atter having a coefficient of 
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diffusivity which is different for different wave-lengths, and varies too 
with the intensity of the stimulus. These theoretical assumptions are 
discussed in some detail. 


In order to investigate further this visual diffusivity of the eye, Ives con- 
structed an apparatus consisting of two concentric circular disks of sheet metal, 
one outside the other, with a narrow radial slot on each. The relative position 
of these radial slots could be varied. Colored glasses could be placed in the 
slots. The speed of rotation of the disks could be varied. A source of illumina- 
tion was placed behind the disk. 

When one slot was of a dark neutral tint while the other was of a lighter 
neutral tint, and the disk rotated, then the dark slot appeared several degrees 
behind the other. At very high or very low intensities, this phenomenon was 
less marked. 

When one slot was covered by a blue filter, and the other by a red filter, 
the blue image lags behind the red image by several degrees. If red, green, and 
blue filters are used, the red is first, then green, and then blue. Ives writes, 
“This lagging of one colour behind the other is a very striking phenomenon, well 
suited to demonstration to a small audience. The effect is most extraordinary if 
the disk, instead of being rotated, is rapidly oscillated through five or ten de- 
grees, then the blue image may be made to vibrate entirely out of phase with 
the red, appearing as though attached to it by a cord.” 

If a purple (mixed) color is revolved, the two components do not resolve 
(give one slot of red and one slot of blue). Ives explains this by saying, “the 
difference in diffusivities peculiar to two colors is much reduced when they are 
simultaneously transmitted to the same retinal area, although it is still present 
in some degree.” 


Nevertheless, the following year Ives (47) did succeed in resolving a 
color. He built a new apparatus, which utilized a projection lens system 
and a rotating mirror. He proposed to determine experimentally 
whether a mixed yellow resolved into its constituent red and green, 
and, whether a pure yellow will be resolved into anything at all except 
pure yellow. Ives found that the compound yellow was clearly resolved. 
into red and green by a lateral motion across the field of view. Pure 
yellow did not separate. 

Actually, Ives never did integrate his results with purple and yellow- 
We may point out that they were obtained with different apparatuses 
and that may have something to do with the experimental results. But 
what is important, and what Ives failed to realize, is that he did get 
a stimulus to resolve. The resolution is probably not caused by the 
differential diffusivity as Ives supposes, but by the differential rise an 
fall of the receptors (see 14). Further work using his technique shoul 
yield most valuable results. 

In 1922, DuBois (28), experimenting with mollusks, noticed that 
visual receptors respond at different rates to different wave-lengths- 
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Fig. 5b is Kormann's design for 
The pattern is stationary and is 
n for use on a kymograph drum. 
tionary. The colors are seen as 
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He proposed a theory closely analogous to the modulation theory of 
Troland (78). He observed, in passing, that the color experiences ofa 
human observing the Benham disk may have a similar basis. 

The first three of four important papers by Piéron (58, 59, 60) ap- 
peared in 1922 and 1923. Piéron states the problem of the subjective 
colors as being psychophysiological in nature. Physical theories such 
as those of Benham are impossible since subjective colors are seen in 
monochromatic light. The colors cannot be the same as after images 
because of the difference in time of the two phenomena. They are, he 
believes, related to such occurrences as the “fluttering heart," as Helm- 
holtz had suggested. Again, he proposes the Fechnerian theory as a 
basis for the colors, although he does not give credit to Fechner. The 
curves, he states, of the rise and fall of the receptors ought to be de- 
ducible from the colors observed during the course of time. Further 
characteristics of the curves should be deducible from the fact that the 
curves rise faster when the intensity of the stimulating source is higher. 

Piéron devised a Benham-type disk, after those of Maxwell, in 
which he could vary the speed of rotation, the black sector, the white 
sector, and the length of the lines. It is really an elaborate version of 


the apparatus of Doniselli (sce Fig. 4a), and one form is illustrated in 
Fig. 5a. 


The first series of experiments indicated that the diverse colors correspond 
to the defined delays (angles between black and beginnings of arcs) of the gen- 
erating arc in relation to the beginning of the white sector, when the angle of 
the arc is kept constant. 

The second series of experiments showed that the colors change under 
changes in illumination. But these changes can be compensated for so that the 
color will appear constant (but with perhaps a different saturation), if either or 
both the speed of rotation and/or the angular white section are changed. He 
shows that the characteristic delay of each color varies inversely as the fourth 
root of the intensity of the stimulus light. Experimentally, if the intensity is 
too high or too low, no color is seen. ! 

The third series of experiments on the Benham disk indicated that thin lines 
show optimum subjective colors. As Finnegan and Moore previously noted, as 
the lines become thicker, color is seen only on the edge. If the lines become very 
thick, no color is seen at all. 

Under normal illumination, the optimum length of the arc is about 45? (the 
lengths of the arcs of the Benham top). If the arc is 180?, no color is seen at all. 
Less color is seen at angular lengths between 180? and 45°, and less than 45°. 

The fourth series of experiments showed that the amount of white sector 
following the lines has little influence on the colors seen, When the white 
precedes the lines, the amount of white determines the colors, with the exception 
of blue, which is always found in the terminal position. Several disks are pro- 
duced in Piéron's paper which, although widely different, illustrate such equiva- 
lence of color. The shortening of the black sector when the speed of rotation is 
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sufficiently rapid, weakens the blueing effect of the "privileged" terminal posi 
tion of the lines. Under the same conditions the leading arc goes from il i 
green because the delay is longer. The elongation of the black sector has the 


opposite effect on both the leading and terminal lines. 
The fifth series of experiments involved interpolating a segment of the blacl 
sector in the white sector, and on a different disk, the reverse procedure. In. 


sertion of a small black sector in the middle of the larger white sector tends tc 
blue the lines which precede, and redden those which come after. The inter. 
position of a white phase on the black sector has an effect analogous to th 
simple shortening of the black sector, and the result is as described above. 

In the sixth series of experiments Piéron compared the color appearances o! 
the Benham disk under radiations through Wratten filters (running the gamut 
of the spectrum), with color appearances under white light, in much the same 
manner as Stewart and Abney. He finds that colors are seen which are not in. 
cluded in the “monochromatic” radiations, as did the previous investigators 
but he is unable to derive any law of formation. 

Finally, if a negative of the Benham top (similar to that of Finnegan anc 
Moore, see Fig- 10) is rotated, colors are never seen on the white arcs, althoug! 


colors are seen on other parts of the disk. 

Piéron attempted to deduce th 
d fall of the receptors. The curves, which change i1 
the illumination, are essentially like Stewart' 


third case (see Fig. 3c), with the difference that Piéron proposes a $0 
’ i.e., where each receptor responds to. 


called “hypermaximal stage,’ 
high point before reaching an equilibrium. The receptors invariabl 
rise in the order red, green, blue, 


The black rings of the Benham disk appear t 
' from the neighboring white surfaces which are being pulsec 


on the lines is determined by the amount of activity i 
he resulting amount of irradiation. Thus, if there is n 
i.e., the lines follow the black directly, red is see 
strength of the diffusion. If there is some activity in tl 
lines follow white which follows the black, then som 
because there is less diffusion. 


tic light the color pulsed tends to stimulate ar 
gonistic receptors to di 


From his experimental results, 


curves of the rise an 
form depending upon 


fusion' 
The color seen 
the lines and t 
activity in the lines, 
because of the 
lines, i.e., the 
other color is seen 


In monochroma 
certain receptors, which allow the anta 


charge, and the complementary color is seen. Thus it follows that stir 
ulation at first tends to influence the receptors most highly tuned 
the stimulus, and thereafter the complementary receptors. Piéror 
experimental results are quoted to prove this hypothesis. 
Piéron concludes by formulating two laws which will predict | 


colors of the Benham disk: 


Law I. Under the influence of alu 
plex radiation which integrates to ma 


fatigue 


itation of the retina by aco 


minous exc 
here is first produced a ch 


ke white t 
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matic disequilibrium with successive predominant shades running from red to 
blue in order of the spectral colors, because of an unequa! speed of establish- 
ment of the fundamental chromatic processes released by this excitation until 
the hypermaximal transitory state is attained. 

Law II. When a small retinal surface is not, or is only feebly excited, in the 
neighborhood of a region which is the site of a luminous and chromatic excita- 
tion, the excitation of this surface by diffusion shows a predominance of chro- 
matic processes which are propagated with an intensity much greater than the 
luminous processes. 


Forester (35) pretended to use the Benham disk in proof of his 
highly improbable theory of color vision. He proposes two levels of 
excitation in the nervous system relating to color perception. The first 
is in the retina, called the R level, and the second is in the optic tract, 
and is called the T level. White light has the interesting property of 
being able to by-pass the R level and influence only the T level. It nul- 
lifies the centers at the T level, and prohibits them from being influenced 
by the receptors at the R level. White is the physical stimulus on the 
rotating Benham disk (with the exception of the black sector) except 
in one place—where the lines are on the white surface. The red re- 
ceptor can and does stimulate this spot, since the receptors are not 
nullified at the T level but why it should be red and not some other 
color seen is not made clear. The theory makes extensive use of the 
concept of Lapicque's chronaxie in its exposition. 

Pauli and Wenzl (56) in 1924 also attempted an experimental an- 
alysis of the colors produced on the Benham disk. The experimental 
variables were the composition of the disk (distribution of black, white, 
and colored sectors), number of arcs, size of the segments and openings, 
diameter of the disk, color of the arcs, color of the dark sector, and speed 
and direction of rotation. These factors were varied by means of a 
Maxwell-type disk. 

Pauli and Wenzl report that the colored appearances of the Benham 
disk can be divided into three groups based on the speed of rotation: 
the first at 3—5 rotations per second, the second at 5-37 rotations per 
second, and the third at 33-40 rotations per second. The colored ap- 
pearances are given in great detail and will not be discussed here— 
except to mention that in the second phase a mosaic pattern is seen, 
which may be the hexagonal pattern seen by others. 

If the white field is replaced by a colored field, after the manner of 
Bagley, the black sector shows complementary colors. If the black field 
is replaced by a colored sector, the colors are still seen, but they may be 
altered somewhat, depending on the particular color field substituted. 

Subjective colors may be seen during tachistoscopic exposure. 
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Under these conditions the colors show a decrease in intensity and dis- 
tinctness, and can be seen only at a fast speed of rotation. In direct 


sunlight many of the colors disappear completely. When the disk is 


viewed peripherally, no colors may be seen. In general, Pauli and 


Wenzl's other results agree with former investigators. 

Pauli and Wenzl, in their theoretical explanation, state that after 
images are not responsible for the colors. They theorize that each of 
the receptors has a typical inertia, and consequently a typical natural 
period. When the frequency of the pulse is that of the natural period 
of the receptors, it will respond. Pauli and Wenzel, after the manner of 
Piéron, also postulate an induction phenomenon in explaining the 
spread of the colors. “We have," they say, "an overlaying of three 
factors; primary excitation, resonance and number of excitations, and 
the induction effect.” 

Von Kries (81), writing in the 1924 American translation of Helm- 
holtz, pointed out that Helmholtz’ earlier description of the subjective 
colors could be applied to the Benham top. E 

Stewart (77) in 1924 wrote his second paper on the subjective colors. 
Observations were made by looking at a rotating mirror which reflected 
the light source. The colors appeared as luminous bands, and, in the 
main, the color appeared homogeneous over the entire surface except 
at the edges. However, the color phenomena on the edges of the band 
were essentially the same as the rest of the band, but different in phase, 
because the edges were illuminated by slightly less intensity. 

In general, a white light of given intensity may be pulsed at pro- 
gressively increased speeds so that the subjective color sensation Lye 
progressively from violet to red approximately in the eee of the 
spectrum. The colors after the green on the red end o fe e nem 
are less pure than the others because the pulse rate is tapı y powers 
ing the fusion point. However, under suitable conditions, when ? P 
lumination is carefully controlled, a very good red can be obtained. t 
can further be shown that when the pulse rate is held constant, Sie 
same succession of colors can be obtained by adjusting the ome o 
the light; when the physical intensity 1$ increased the colors shift to- 
ward the violet and when the physical intensity is decreased, the colors 
shif » uai cen bas be observed by looking ata black and an picture 
through a black, white, and sectored disk (similar to Bidwe J Fig. 1u). 

y the physical intensity of the 


ined b 
ffects are most determine i 
ea of any given portion of the picture, in other words, the 


shading. Stewart writes: 
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It is easy to cause the darker foliage in an engraving of a landscape to = 
sume a green color while the less shaded water or sky is violet or bluish, with 
the details of the shading represented by corresponding nuances of the sub- 
jective color. Then by a change of speed of the disk a complete change can be 
produced in the colors, which again, however, are appropriately placed on the 
different parts of the picture according to the depth of the shading. The colors 
are so easy to obtain that I have seen children amuse themselves by the hour 
"coloring" pictures in books or newspapers subjectively by observing them 
through a rotating disk, and describing the colors correctly. While it is gener- 
ally not difficult to clothe the landscapes in colors which are appropriate, or at 
least, not glaringly incongruous, it is of course different with pictures in which 
human faces or figures are the important element. With the shading below ihe 
chin perhaps a light green (at a particular speed of rotation), the cheeks violet or 
bluish, the hollows of the eyes perhaps green, the hair a dark green, and various 
garments colored according to their shading (creases, for instance, at a given 
phase green), and then everything changing phase with an increase or dimuni- 
tion in the speed, the personal appearance of a state beauty, the statesman or 
criminal of the hour, the horse which has just won the Derby, or the reigning 
pugilist, is apt to be somewhat bizarre, and therefore rather amusing for chil- 
dren. On theother hand black and white pictures of such objects as embroidery, 
tapestry, Scotch tartans, carved work, stained and cut glass, etc., may show 
curious and even beautiful appearances, the subjective colors imparting a cer- 
tain richness to the designs. Of course, unless the black and white pictures were 

expressly produced in order that the subjective tints might approximate the 
actual colors at some rate of intermission (a feat perhaps in simple cases theo- 
retically possible), the subjective colors could only by accident be correct. 


Stewart includes a rather elaborate discussion of his observations 
of a rotating Benham top. In general his results agree with those of 
previous investigators. When the lines are observed through a magni- 
fying glass, the color is the same as when observed with the naked eye. 
When the Benham top is held stationary and observed through a rotat- 
ing sectored disk, the color appearances on the Benham top are about 
the same as those seen when the top itself is rotating. Of course, all the 
arcs appear to be the same color, but the color can be made to change 
depending on the speed of rotation of the sectored disk. In experiments 
utilizing Benham disks with arcs of varying thickness, colors can be 
seen throughout the arc no matter what the thickness, although differ- 
ent colors are seen in different portions—a function of the speed of 
rotation. Color blind individuals do not see the subjective colors with 
the same clarity as normal individuals. 

Stewart conducted experiments wherein colored lights were ob- 
served in the rotating mirror and through the rotating disk. He also 
viewed colored pictures under the same conditions. He reports that 
with a slow pulse rate, the blue tints are strengthened, white becomes 
violet and the greens and reds are weakened. As the pulse rate is in- 
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creased, the blue remains predominant and the greens are strengthened 
At a high rotational speed, the reds prevail. The color changes in the 
picture are primarily dependent on the physical saturation, hue, and 
brightness of the picture; the Purkinje phenomenon is not a factor. 
Concerning the effect of the subjective color sensations on evolution, 


he writes: 

The question suggests itself whether the color sensations of animals in an 
environment habitually or frequently illuminated by flickering light, as in 
forests or not far below water surfaces, could be modified at all by such phe- 
those under discussion. If so, protective color adaptation might in 
these circumstances to take account of the fac- 
tor of subjective coloration. Protective coloration has considerable interest in 
connection with the question of color vision in animals. For it seems in some 
cases to afford a means of testing whether the response in sensation to given 
wave-lengths is approximately the same as in man. Whether the subjective 
modifications of color sensation caused by intermission at appropriate rates 
enter at all as a factor into such phenomena as the aurora borealis, or some other 
of the solar color phenomena, depends upon the magnitude of any intermittent 
inosity concerned in them and their period. Where the spectral 
fliciently explain the perceived colors there is of course 


ective factors. 


nomena as 
some degree be expected under 


changes of lum 
characters of the light su 


no room left for the subj 
subjective colors in 


Piéron (61) published his fourth paper on the 
1928. Essentially, it was à modification of the work of Abney. He ob- 
served in a most systematic fashion a rotating Benham top under 
various white lights. These white lights were always composed of two 
near monochromatic radiations. The whites looked alike; their spectral 
compositions were vastly different. Finally, the top was observed under 
a yellow source, composed of red and green radiations. —— 
Piéron’s results conclusively show that the color seen is a function 
of the spectral composition of the illuminating source and not of its 
tristimulus values. The results give rise to two principles. (1) The 
n of the illuminating 
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inant radiations of the spectral composi f 
cee d to becom (2) The complement of the domi- 


tends d principle may be in op- 
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e accentuated. 
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te a color wh 
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position to the 
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Kormann (50) i 
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performed by Doniselli anc 
of which is illustrated in Fig. 5b. He pon 
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Wolf's disk (see Fig. 1t) which. 
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lar to a half moon—like 
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tic spectrum and reported 


— 


AND DONALD A. GORDON 


JOZEF COHEN 


Siutuoz s,paeyqay syoidop J9 ` 


Wi pE aie 


\ 


“SPAY 
jp uonoenqur jo səyə P Buryensnylt sysyp s Arg 3 39-v9 “SHA 


19-29 


'"8u0'I07) azAluD3Íang AHL JO SOILSIMALOVAVHD AZIYA aH '97914 


zn. 


PREVOST-FECHNER-BENHAM SUBJECTIVE COLORS 127 


given intensity of stimulation increasing the duration causes the color to change 
progressively in the direction from green toward pink. . . Beyond a certain 
limit, however, increasing the duration causes the color to change to white. 


For the final experiment in this paper, the surface illumination of the 
disk was made both brighter and equal to B. Pink can be produced with- 
out the aid of bright surroundings, while purple, blue, and green cannot. 
All of these experimental results, some of which were known before, 
are exceptionally interesting and provocative. The fact that they do not 
disprove the modulatory theory is only of minor importance since they 
do not disprove a number of other theories, for example, that of Forster 
(35). Actually, in science, we never prove that a theory is right, but 
only that it is not Wrong. Fry's analysis in terms of the modulation 
theory would have been a more powerful document had it shown the 
Fechnerian theory to be wrong. . 
Fry's (37) second paper in 1933 makes inquiry concerning the 
stimulated retinal area, beta, and its surround. If beta and the surround 
can be pulsed or not pulsed, there are four possible combinations: beta 
surround intermittent; beta 


is steady, surround steady; beta steady, 1 : 
intermittent, surround steady; beta intermittent, surround intermit- 


tent. If it can be shown that the color seen is dependent solely on the 
intermittence of beta, then no other facts need be invoked for an ex- 
planation. The color seen in beta can be considered to be a color con- 
trast phenomenon if the color is independent of the pulsing of beta and 

The level of the intensity of 


dependent on the pulsing of the surround. 
the surround, whether it pulses or not, can be shown to affect the colors 


seen in beta. . . 
Fig. 6b illustrates Fry's disk for this ser! 
A' indicate slits. The dotted lines indicate 


When A, B, and A' are all intermittent, A 
appears green. when A m E dye 
A f appear pink and-B appear^ Pe ae wi à 
wires Le T id then A and A’ appear pink, and v dein emi 
green. The contrast effect is non-operative. Fig. 6c indicates an ad- 
ditional stimulus pattern. A, LIP: and C are slits. When the disk is 
rotated, b is seen as yellow. If b remains steady, and A and A’ are 
intermittent, still appears yellow. If A and A’ are steady, then b 
appears blue. The yellow of b is a contrast phenomenon. Similarly, the 
i ‘ble for the colors seen 1n A and A'. Fig. 
g interaction. When 


contrast effect by b is repone le dor intrat 

6d indicates another stimu us patter 

rotated in a counterclockwise direction, A and A’ appear purplish blue, 
In a reverse direction, the colors are 


and B appears yellowish white. 
oo r with a discussi 


Fry concludes this pape t 
ing that the red seen !9 due, at least in par 


ies of experiments. A, B, and 
an illuminated background. 
and A’ appear pink and B 
and B is intermittent, then 
When A and A’ are pulsed, 


on of the Benham top, add- 
t, to chromatic aberration. 
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On a different disk, subjective red borders, when viewed through a con- 
vex lens, appear to be blue. 

Fry takes issue with Piéron's concept that the leading white sector 
is the determiner of the color. He observes that a disk shown in Fig. 
6e, when rotated, will shown green. It will still show green no matter 
how much the inner circle CBDO is rotated with respect to the rest of 
the disk. It is only when the absolute amounts of white at B and D are 
changed that the color is changed. 

Fry's (38) third paper, published in 1935, attempted to show that the 
trichromatic theory (our Fechnerian theory) was inadequate in its 
explanation of the subjective colors. He constructed a rather elaborate 
apparatus which presented the same spectral source to two fields, which 
could be observed by the subject. The intensity of one field could be 
varied by adjustment of the sector slit of an episcotister, while the in- 
tensity of the other could be varied by adjustment of the speed of ro- 
tation of another episcotister. The conditions of equivalence between 
the two fields were studied for different brightnesses of ground and for 
different wave-lengths. That the ground is important is illustrated by a 
series of curves, but their theoretical implications need not concern us 
here. 

A more important series of curves is that which designates equivalent 
brightnesses between the two fields when wavelength is varied. Ac- 
cording to Fry, the fact that the red response is relatively weak finds 
basis in how one interprets the response of the receptors. If the red 
response is due primarily to the response of the red receptor, then we 
can say that the red receptor is weak. Experimentally, he demonstrates 
that a bluish green sensation can be obtained from white light which 
contains all the colors of the spectrum, and also from white which is 
composed only of blue and green. This may give support to the concept 
of the feeble response of the red. Three facts remain unexplained. Col- 
ors other than blue green can be obtained with an intermittent white 
light. If the stimulating light is of sufficient intensity, or the exposures 
are long enough, or both, then a pink sensation results. The blue gree? 
sensation cannot be produced unless the surround is of high intensity- 
Fry concludes, "As in the case of the eye, so also in the case of the photo- 
graphic plate (15), intermittent exposure is less effective than contin- 
uous exposure for all wavelengths, and the longer waves suffer more 
than the rest in this respect." The three component theory, says Fry, 
offers no ready explanation for these facts. 

Fry's (39) fourth paper, published in 1936, makes use of an apparatu? 
similar to that described in his third paper. Fry investigated the sub- 
jective colors seen under pulsing monochromatic lights, and un id 
pulsing monochromatic lights whose saturations were reduced by 4 
non-pulsing white light. The effects of varying the wavelength and 
frequency of the pulse were investigated. The experiments are similar 
to those of Abney and Piéron, to which Fry makes no mention. 
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me m color seen is that of the hue of the wave-length of the stimu- 

P ig lig t. There is, however, à tinge of blue and violet, in the second- 

diues thought to be similar in nature to Bidwell’s ghost. When 

: uli o lower saturation (as described above) are pulsed, the second- 
ry images tend to be complementary to the primary stimulus. 


" m and Dallenbach (31) presented a most important contribution 

39. They proposed to study the Luckiesh-Moss stimulus pattern 
(see Fig. 5e) to determine the effect of knowledge and instruction, color 
of the pattern and color of the background, the width of lines and the 


spacings, and eye movements on the subjective colors. 

e subjective colors were shown 
f instructions. One group was 
as told to use far fixation, 
ledge of the phenomenon 
guarantees that they will be 
w them. Phenomenal move- 


Observers with and without knowledge of th 
h three different types o: 


ment was reported by very many 
ing band and zig-za£ lines, move 
shimmer. There is no re 
movement. 

The pattern was printed in red, yellow, green blue, black, and white upon 
red, yellow, green, blue, black, neutral gray, and white backgrounds. It was 
also printed upon various stocks of paper. T he following combinations were 
found to give the optimum colors: black lines on white background, blue on 
Wite, green on white, black on green, black on yellow, and green on yellow. 

he best of these is black on white. 

Narrow lines and narrow spaces give saturated 
spaces, or the reverse, give less saturated colors. 
give no color at all. . 

When the stimulus pattern is placed in the Dodge tachistoscope, and the 
exposure is very short, nO colors are seen. An exposure of very much greater 
length than that required for eve mov! i essary before the subjective 
ta are observed. It follows, that although ey’ ements are a necessary 
actor, they are not sufficient or compulsory- 


T. (1), a student of Dallenbach, 
stimt Her problem was to observe the 
The ss pattern under monochromatic a 
tinu ollowing light sources 
492 e spectrum; a mercur 

, 436, and 405 millimicrons; 
Most exclusively the 577 and 579 millimicron 
nae a green filter which transmitted the 546 lin 
at Poli filter, used with the continuous Sources 

millimicrons. Abel writes: 


hues; narrow lines and wide 
Broad lines and wide spaces 


d these researches in 
e colors of the same 
omatic illumination. 
were used: à tungsten lamp radiating à con- 

ith its lines at 577, 579, 546, 


y arc lamp W! j 
ber filter which transmitted al- 


lines from the mercury 
e of the mercury arc; 


with a sharp cut off 


continue 
subjectiv 
nd polychr 
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The main results may be classified in the following manner: (\) wore 
tion among the black diagonal lines both of hue and of brightness. bx I 
in the interspaces between the black lines either filling in all the space be A 
the lines or merely along the edges of the lines. (3) Pattern lines diagor ied 
across the parallel black lines. Sometimes these patterns were black ar g ay 
but more often colored pastel shades. (4) When colored phenomena were e 
ported, continual change of the color was observed generally in the direction 
an increase in saturation. 


Under all conditions, both with polychromatic and monochromatic 
lights, with the exception of monochromatic red, many subjective colors 
were reported. In polychromatic light, the colors were usually more 
saturated than they appeared in monochromatic light. The colors seen 
were not necessarily restricted to the after image of the physical color. 
With red light, only various shades of red are scen. These results are 
in direct opposition to those reported by Fry. 

Segal (67), a student of Piéron, in 1940, independently 
the modulation theory of Troland, but did not mention ei 
or Fry. He too points out that the sub 
incongruous with the theory. 

Gebhard (40) contributed the mos 
tion in 1943. He utilized the left sid 


hypothesized 
ther Troland 
jective color appearances are not 


gure and ground. For both this 
figure and ground, it was possible to vary the luminosities independ- 


ently, the frequencies of intermittence, the on/off ratios, and the phase 
relationships. Thus was allowed a great variety of experimental con- 


ditions. Three kinds of illumination were used: a white continuous 
source, a Wratten red filter which transmitted nothing below 650 mil- 
limicrons, and a filter combination for wave-lengths at 560 millimicrons- 

Observations were made with w 
ous frequencies and conditions whe 
field, and vice versa. For two of th 


Produced no subjective sensation, 
characteristic 


n ical 

s tributable to the physica 

i eid poet. stimulus. aw a multitude of colors: 

easter a * i nard suggests, that the difference was due to the observer $ 
stigmatism, is led to the Second 

The boundary betw. problem; 


employing vari 
se lead over be 
the red and green stimuli 


he bars." This effect is illus 
€ wider the bars, the greater the number of zones whic 
are produced. When the illumination was reduced, two of the three subject? 
could not see zones. Practice and fati 
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Geld color effects. Geometrical patterns are mentioned again, and these may 


be the same as Fechner's hexagonal cells. 


Gebhard attempts to show the inadequacy of the Fechnerian theory 


by saying, "Colors are producible under various conditions of stimula- 
tion with monochromatic red illumination so chosen as to act upon the 
R retinal process only.” One wonders why Gebhard assumes that he 
is stimulating one retinal process with a filter transmitting 650 milli- 
microns and above. The further inadequacy is shown by, ‘‘The slow 
arousal of the zoning effect with blurring and the general instability re- 
flected in long term fluctuation of the zones.” Gebhard continues, 
“it is doubtful if the modulation hypothesis is sufficiently compre- 
hensive to deal with the formation of multiple boundaries, the inter- 
mittent and steady field effects, the instability of the colors and zones 
and the long term developmental effects." 


Finally, Brown and Gebhard (18) in 194 
h were of a binocular nature, 


8 reported several new color 
phenomena, some whic which may be re- 


lated to the subjective colors. 

The present paper has mad 
direct measurement of the rise an 
cellent survey of this material may 


he literature on the 


e no attempt to cover t 
An ex- 


d fall of the color receptors. 
be found in Bills (14). 


VIII. SUMMARY 
m subjective 


e Prevost-Fechner-Benha 
e criterion for the determination of the 


Prevost (62) in 1826. 


1. It is suggested that th 
colors may be used as an outsid 


axes in the color space. 

2. The subjective colors were discovered by 
They were rediscovered by Fechner (33), John Smith (73, 74), Brewster 
(16), F. J. Smith (72), Hannay (42), Stewart (76), Charpentier (21), 
Benham (84), Ives (46, 47), Klopsteg (49), Way (82), and Skinner 


(i) . . 
e seen in any white light which can be 


Such colors have been 


reported by Prevost 
(42), Stewart (76, 77), Charpentier (21, 22, 
(57), and Fry (39, 40). 

The colors are most easily seen on rotating black and white disks 
of various designs. Such disks (or designs on kymograph drums) have 
been presented by Fechner (33), Dove (26, 27), John Smith (73, 74), 
Helmholtz (43), Briicke (19), Benham (84), Finnegan and Moore (34), 
Hurst (45), Wolf (85), Baumann D (25), Edridge-Green 

der (24). 


(30), Kormann (50), and Colenbran . 
Colors have been obtained from sectored disks by Rood (63), 


Brewster (16), La Nature (41), Bidwell (11), and Fry (36, 37). 
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The main results may be classified in the following manner: (1) Differentia- 
tion among the black diagonal lines both of hue and of brightness. (2) Colors 
in the interspaces between the black lines either filling in all the space between 
the lines or merely along the edges of the lines. (3) Pattern lines diagonally 
across the parallel black lines. Sometimes these patterns were black or gray 
but more often colored pastel shades. (4) When colored phenomena were re- 
ported, continual change of the color was observed generally in the direction of 
an increase in saturation. 


Under all conditions, both with polychromatic and monochromatic 
lights, with the exception of monochromatic red, many subjective colors 
were reported. In polychromatic light, the colors were usually more 
saturated than they appeared in monochromatic light. The colors seen 
were not necessarily restricted to the after image of the physical color. 
With red light, only various shades of red are seen. These results are 
in direct opposition to those reported by Fry. 

Segal (67), a student of Piéron, in 1940, independently hypothesized 
the modulation theory of Troland, but did not mention either Troland 
or Fry. He too points out that the subjective color appearances are not 
incongruous with the theory. 

Gebhard (40) contributed the most recent, thorough communica- 
tion in 1943. He utilized the left side of an apparatus described by 
Brown (17). The subject observes a figure and ground. For both this 
figure and ground, it was possible to vary the luminosities independ- 
ently, the frequencies of intermittence, the on/off ratios, and the phase 
relationships. Thus was allowed a great variety of experimental con- 
ditions. Three kinds of illumination were used: a white continuous 
source, a Wratten red filter which transmitted nothing below 650 mil- 
limicrons, and a filter combination for wave-lengths at 560 millimicrons- 
M ea Dd aide cet E asi e 
field, and vice versa. For two of fin ttes Lose y hende ed : m 

wde ervers, the red and green stimu 
produced no subjective sensations other than those attributable to the physical 


characteristics of the stimulus. The third observer saw a multitude of colors- 
It is possible, Gebhard suggests, that the difference was due to the observer's 
greater astigmatism. This led to the second problem. 

The boundary between the field and ground was made indistinct. Under 
these conditions all observers saw colors—some where they had not seen them 
before. The field was now made into a rectangular bar whose width could be 
varied. Now, “the area of the figure and the part of the field immediately 
adjacent to it, was found to divide itself symmetrically into a series of zones Of 
bands parallel to the two lateral boundaries of the bars." This effect is illus- 
trated in Fig. 6f. The wider the bars, the greater the number of zones which 
are produced. When the illumination was reduced, two of the three subject? 
could not see zones. Practice and fatigue are necessary for the optimum pet 
ception of the zones. Simultaneous with the zoning effect, there were many 
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field color effects. Geometrical patterns are mentioned again, and these may 


be the same as Fechner's hexagonal cells. 


Gebhard attempts to show the inadequacy of the Fechnerian theory 
by saying, ''Colors are producible under various conditions of stimula- 
tion with monochromatic red illumination so chosen as to act upon the 
R retinal process only." One wonders why Gebhard assumes that he 
is stimulating one retinal process with a filter transmitting 650 milli- 
microns and above. The further inadequacy is shown by, ‘The slow 
arousal of the zoning effect with blurring and the general instability re- 
flected in long term fluctuation of the zones." Gebhard continues, 
“| it is doubtful if the modulation hypothesis is sufficiently compre- 
hensive to deal with the formation of multiple boundaries, the inter- 
mittent and steady field effects, the instability of the colors and zones 
and the long term developmental effects.” 

Finally, Brown and Gebhard (18) in 1948 reported several new color 
phenomena, some which were of a binocular nature, which may be re- 


lated to the subjective colors. . 
The present paper has made no attempt to cover the literature on the 
direct measurement of the rise and fall of the color receptors. An ex- 


cellent survey of this material may be found in Bills (14). 


VIII. SUMMARY 

1. It is suggested that the Prevost-Fechner-Benham subjective 
colors may be used as an outside criterion for the determination of the 
Prevost (62) in 1826. 
h (73, 74), Brewster 
Stewart (76); Charpentier (21), 
(82), and Skinner 


2. The subjective colors were 
They were rediscovered by 
(16), F. J. Smith (72), Hannay (42), 
um (84), Ives (46, 47), Klopsteg (49), Way 
1 
3. Subjective colors may be seen in any white light which can be 
made to pulse below the fusion frequency: Such colors have been 
reported by Prevost (62), J. Smith (73, 74), F. J. Smith (72), Hannay 
A Stewart (76, 77), Charp 23), Bidwell (11), Percival 
, and Fry i : . 
The sahara we easily seen on rotating black and white disks 
of various designs. Such disks (or designs 07 kymograph drums) have 
een presented by Fechner (33), Dove (26, 27), John Smith (73, 74), 
Helmholtz (43), Brücke (19), Benham (84), Finnegan and Moore (34), 
Hurst (45), Wolf (85), Baumann (6), Doniselli (25), Edridge-Green 
(30), Kormann (50), and Colenbrander (24). . Rood (63 
Colors have been obtained from sectored disks by Roo (63), 
Brewster (16), La Nature (41), Bidwell (11), and Fry (36, 37). 
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The main results may be classified in the following manner: (1) y joe 
tion among the black diagonal lines both of hue and of brightness. (2) Goldie 
in the interspaces between the black lines either filling in all the space wate 
the lines or merely along the edges of the lines. (3) Pattern lines diagonally 
across the parallel black lines. Sometimes these patterns were black or gray 
but more often colored pastel shades. (4) When colored phenomena were E 
ported, continual change of the color was observed generally in the direction 0 
an increase in saturation. 


Under all conditions, both with polychromatic and monochromatic 
lights, with the exception of monochromatic red, many subjective colors 
were reported. In polychromatic light, the colors were usually more 
saturated than they appeared in monochromatic light. The colors seen 
were not necessarily restricted to the after image of the physical color. 
With red light, only various shades of red are seen. These results are 
in direct opposition to those reported by Fry. 

Segal (67), a student of Piéron, in 1940, independently hypothesized 
the modulation theory of Troland, but did not mention either Troland 
or Fry. He too points out that the subjective color appearances are not 
incongruous with the theory. 

Gebhard (40) contributed the most recent, thorough communica- 
tion in 1943. He utilized the left side of an apparatus described by 
Brown (17). The subject observes a figure and ground. For both this 
figure and ground, it was possible to vary the luminosities independ- 
ently, the frequencies of intermittence, the on/off ratios, and the phase 
relationships. Thus was allowed a great variety of experimental con- 
ditions. Three kinds of illumination were used: a white continuous 
source, a Wratten red filter which transmitted nothing below 650 mil- 
limicrons, and a filter combination for wave-lengths at 560 millimicrons- 

Observations were made with white, 
ous frequencies and conditions whereby 


field, and vice versa. For two of the thr 
produced no subjective sensations other 
characteristics of the stimulus, The third observer saw a multitude of colors. 
It is possible, Gebhard suggests, that the difference was due to the observer's 
greater astigmatism. This led to the second problem. 

The boundary between the field and ground was made indistinct. Under 
these conditions all observers saw colors—some where they had not seen them 
before. The field was now made into a rectangular bar whose width could be 
varied. Now, “the area of the figure and the part of the field immediately 
adjacent to it, was found to divide itself symmetrically into a series of zones OT 
bands parallel to the two lateral boundaries of the bars." This effect is illus- 
trated in Fig. 6f. The wider the bars, the greater the number of zones which 
are produced. When the illumination was reduced, two of the three subjects 
could not see zones. Practice and fatigue are necessary for the optimum pet 
ception of the zones. Simultaneous with the zoning effect, there were many 


red, and green lights, employing vat! 
the ground had a phase lead over the 
ee observers, the red and green stimuli 
than those attributable to the physical 


| 


PREVOST-FECHNER-BENHAM SUBJECTIVE COLORS 131 


field color effects. Geometrical patterns are mentioned again, and these may 


be the same as Fechner's hexagonal cells. 
the inadequacy of the Fechnerian theory 
ble under various conditions of stimula- 
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R retinal process only." One wonders why Gebhard assumes that he 
1s stimulating one retinal process with a filter transmitting 650 milli- 
microns and above. The further inadequacy is shown by, "The slow 
arousal of the zoning effect with blurring and the general instability re- 
flected in long term fluctuation of the zones." Gebhard continues, 
“it is doubtful if the modulation hypothesis is sufficiently compre- 
hensive to deal with the formation of multiple boundaries, the inter- 
mittent and steady field effects, the instability of the colors and zones 
and the long term developmental effects.” 


Finally, Brown and Gebhard (18) in 1948 reported several new color 
ere of a binocular nature, which may be re- 
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VIII. SUMMARY 
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The on ies easily seen on rotating black and white disks 
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been presented by Fechner (33), Dove (26, 27), John Smith (73, 74), 
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Colors from black and white pictures, when viewed under a pulsing 
light, have been reported by Stewart (77). 

Colors from stationary patterns, due to eye movements, have been 
reported by Skinner (71) and Luckiesh and Moss (54). 

4. Almost all investigators report that the type and level of illu- 
mination, the speed of rotation or frequency of pulses, the design or dis- 
tribution of black, white, and lines, and the state of the eye affect the 
colors seen. 

Movement of some kind appears to be a necessary factor. That 
brief tachistoscopic exposure shows no colors at all is reported by Pauli 
and Wenzl (56) and Erb and Dallenbach (31). 

That practice aids in the perception of the colors is reported by 
Helmholtz (43), Aubert (4), Gebhard (40), but Bagley (5) insists that 
practice is not important. Erb and Dallenbach (31) show that knowl- 
edge of the colors facilitates their observation. 

That there are differences in the colors seen in direct and indirect 
vision is reported by Aubert (4), Exner (32), and Pauli and Wenzl (56). 

Stewart (77) reports that color blind individuals do not see the colors 
with the same clarity as normal individuals. 

Induction phenomena (subjective colors induced by the stimulation 
of an adjacent retinal area) are reported by Bagley (5), Piéron (58, 
59, 60, 61), Pauli and Wenzl (56), Fry (37), and Gebhard (40). 

5. A hexagonal cell structure, seen with th 
been reported by Fechner (33), Brewster (16), 
(4), Exner (32), Edridge-Green (29), and Gebhard (40). 

6. Subjective colors from non-w 
Those who reported subjective col 


Bie nerve is supported by mg 
tt 3), F 
Segal (67). ews (3), Fry (36, 37, 38, 39), an 
Other theories of a physiological natur 
Prevost (62), N. Smith (75) Charpentier (22), Gra: 
à , , y (86), La Nature (41), 
Henry (44), Bidwell (11), Bagley (5), Doniselli (25), Baumann (6, 7, 8), 


Edridge-Green (30), Ives (46, 47), Piéron (58, 59, 60, 61), Forster 
(35), Pauli and Wenz! (56), and Colenbrander (24) ^ °F? 


€ have been suggested by 
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Other theories of a physical nature have been proposed by J. Smith 


(73, 74), F. J. Smith (72), Benham (9), 


and Schwartze (66). 


Discussion 


This paper has outlined th 
may be that one day these same 
color vision—the specification oft 


e long history of subjective colors. It 
colors may solve the great enigma of 
he receptors of the eye. 


Throughout the history of physiological optics the eye has never 


resolved a stimulus into its comp 
conditions. The receptors cou 
the eye consistently int 


to perceive it as an unanalyzed sensation. 
hanism—since it fulfills this ideal even under the 


ntal conditions, with but a single exception. 
the eye can be made to yield its secret. 
to the differential rise and fall of the re- 


the most elaborate mec 
most elaborate experime 

But it is just possible that 
If the subjective colors are due 


ceptors, then here at last the eye is not so p 
stimulating wave 


lors is true, then w 
find colors which when pulse 


completely integrating the 
theory of the subjective co 
color space until we 


[3 When such colors are found, then we are 
it be that the receptors are imaginar 


data may be taken on the real colors, 
ds to no physical stimulus. 
g the subjective color 
would have to be carefully controlled. 


space which correspon 
perimental variables affectin 
discussed at length in this paper, 


onents under any other experimental 
ld never be determined directly because 
egrates the many wave-lengths of the stimulus 


This integration must require 


erfect after all, for it is not 
lengths. If the Fechnerian 
e need but explore the 
d, show no subjective 
stimulating one receptor. 
y, as is probably the case, then 
and extrapolated into the color 

Of course, the ex- 
s, which have been 


BIBLIOGRAPHY 


1. Aner, T. M. ‘Subjective’ colors from 
line-patterns viewed in polychro- 


matic and monochromatic light. 


Amer, J. Psychol., 1939 52, 610- 
615. 

2. Apney, W. pe W. The artificial spec- 
trum top. Nature, Lond., 1894, 51, 
292. 

3. Appian, E. D., & Martaews, R. The 
action of light on the eye. 
Physiol, 1927, 63, 318-414; 1927, 

"TW 64, 279—301; 1928, 65, 273-298. 

- Aupert, H. Physiologie der Netzhaut. 
Breslau: E. Morgenstern für Aug. 
Schultz und Co., 1865. P. 355 and 
pp. 377-380. 


5. BAGLEY, F. w. An investigation of 
Fechner's colors. Amer. J. Psychol., 


1902, 13, 488-525. 

6. BAUMANN, C. Beiträge zur Physiolo- 
gie des Sehens: IV. Subjektive 
Farbenerscheinungen. Pftüg. Arch. 
ges. Physiol., 1912, 146, 543-552. 

7. BAUMANN, C. Beiträge zur Physiolo- 
gie des Sehens: V. Subjektive Far- 
benerscheinungen. Pflüg. Arch. ges. 
Physiol., 1916, 166, 212-216. 

8. BAUMANN, C. Beitrige zur Physiolo- 
gie des Sehens: VII. Subjektive 
Farbenerscheinungen- Subjektives 
und objektives Empfinden. Pflüg. 


134 


Arch. ges. Physiol., 1918, 171, 496- 
499. 

9. BENHAM, C. E. The artificial spectrum 
top. Nature, Lond., 1894, 51, 200. 

10. BENHAM, C. E. The artificial spec- 
trum top. Nature, Lond., 1895, 51, 
321. 

11. BIDWELL, S. On subjective colour phe- 
nomena attending sudden changes 
in illumination. Proc. roy. Soc., 
1896, 60, 368-377. 

12. BIDWELL, S. On negative after-images 
following brief retinal extinction. 
Proc. roy. Soc., 1897, 61, 268-271. 

13. BIDWELL, S. Some curiosities of vision. 
Notices of the Proceedings at the 
Meetings of the Royal Institution of 
Great Britain, 1896-98, 15, 354. 

14. Brits, M. A. The lag of visual sensa- 
tion in its relation to wave-lengths 
and intensity of light. Psychol. 
Monogr., 1920, 18, No. 5. 

15. Brarr, J. M., & Hyran, M. C. The 
intermittency effect in photographic 
exposure. J. opt. Soc. Amer., 1933, 
23, 353-358. 

16. Brewster, D. On certain affection of 
the retina. London, Edinburgh, and 
Dublin Philosophical Magazine and 
Journal of Science, 1861, 21 (4th 
series), 20-24. 

17. Brown, C. R. A ‘universal’ apparatus 
for research in physiological optics, 
J. exp. Psychol., 1943, 33, 340-349. 

18. Brown, C. R. & GEBHARD, J. W. 
Visual field articulation in the ab- 
sence of spatial stimulus gradients. 
J. exp. Psychol., 1948, 38, 188-200. 

19. Brücke, E. Über die Nutzeffect inter- 
mitterender Netzhautreizungen, 
Sitzungsberichte der Mathematisch- 
Naturwissenschaftlichen Classe der 
Kaiserlichen Akademie der Wissen- 
schaften, 1864, 49 (Abtheilung 2). 
128-153. 

20. C(ATTELL), J. McK. The spectrum 
top. Science, 1895, 2 (new series), 
13. 

21. CHARPENTIER, A. Phénomènes de 
coloration apparente observés sous 


23: 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


. CHARPENTIER, A. 


JOZEF COHEN AND DONALD A. GORDON 


l'influence d'excitations lumineuses 

instantanées. C. R. Soc. Biol. Paris, 

1891, 3 (9 série), 596-599. 

Remarques et ex- 
périences au sujet de la coloration 
entopique des lumières blanches in- 
stantées. C. R. Soc. Biol. Paris, 1891, 
3 (9 série), 601-604. 

CHARPENTIER, A. Isolement des 
couleurs dans la luminiére blanche 
par leur action successive. C. R. 
Soc. Biol. Paris, 1892, 4 (9 série), 
533-536. 

COLENBRANDER, M. C. Uber sub- 
jektive Farbenerscheinungen. Acta 
ophthal., Kbh., 1933, 11, 437-460. 

DowisELLI, C. Sui fenomeni d'indu- 
zione cromatica da luce bianca e 
sulla natura dei processi consecu- 
tivi. Arch. Fisiol., 1907, 4, 561-593. 

Dove, H. W. Ueber Scheiben zur 
Darstellung subjectiver Farben. 
Pogg. Ann. Physik u. Chemie, 1848, 
85 (151) 526-528. 

Dove, H. W. Darstellung der. Farben- 
lehre und Optische Studien. Berlin: 
G. W. F. Müller, 1853, Pp. 281-283. 

DuBois, R. Recherches expérimen- 
tales sur le róle de la contractilité 
dans les mécanismes sensoriels chez 
les mollusques. J, de Psychol., 1922, 
17, 787-805. 

EpRIDGE-GREEN, F. W. The artificial 
Spectrum top. Nature, Lond., 1895, 
51, 321. 

EprIDGE-GREEN, F. W. Some sub- 
jective phenomena of vision. J. 
Physiol., 1915-16, 50, xl. 

Ere, M. B, & DarLENnBACH, K. M. 
'Subjective' colors from line-pat- 
terns. Amer. J. Psychol., 1939, 52, 
221-241. 

Exner, S. Bemerkungen über inter- 
mittirende Netzhautreizung. Pfliig. 
arch. ges. Physiol., 1870, 3, 214-240. 

FEcHNER, G. T. Ueber eine Scheibe 
Zur Erzeugung subjectiver Farben. 
Pogg. Ann. Physik. u. Chemie, 1838, 
45 (121), 227-232, 

FINNEGAN, J. M., & Moore, B. The 


35. 


36. 


37. 


38. 


39. 


40. 


Al. 


42. 


43. 


44. 


45. 


46. 


4T. 


PREVOST-FECHNER-BENHAM SUBJECTIVE COLORS 


artificial spectrum top. Nature, 
Lond., 1895, 51, 292-293. 

Forster, G. La théorie dynamique de 
la vision des couleurs. Année 
psychol., 1923, 24, 26-69. 

Fry, G. A. Modulation of the optic 
nerve-current as a basis for color- 
vision. Amer. J. Psychol., 1933, 45, 
488-492. 

Fry, G. A. Color phenomena from ad- 
jacent retinal areas for different 
temporal patterns of intermittent 
white light. Amer- J. Psychol., 1933, 
45, 714-721. 

Fry, G. A. Color sensations produced 
by intermittent white light and the 
three-component theory of color- 
vision. Amer. J. Psychol. 1935, 47, 
464-469. 

Fry, G. A. Color sensations produced 
by intermittent spectral stimuli. 
Amer. J. Psychol., 1936, 48, 326-330. 

GEBHARD, J. W. Chromatic phe- 
nomena produced by intermittent 
stimulation of the retina. J. exp. 
Psychol., 1943, 33, 387-406. 

H., E. Expériences sur les transforma- 
tions subjectives des couleurs. La 
Nature, 1897, No. 1268, 256. 

Hannay, J. B- Colour perception. 
Nature, Lond., 1881-82, 25, 604-605. 

HeLMHOLTZ, H.v. Treatise on physio- 
logical optics. (Vol. II.) (3rd ed.) 
(Ed., J. P. C Southall). Menasha, 
Wis.; George Banta Publishing Co. 
for the Optical Society of America, 
1924. Pp. 255-261. 

Henry, C. Applicatio 
métrie et à l'ophtalmologie d'un 
mode de production, jusqu'ici i 
expliqué, de la couleur. C- 
Sci., Paris, 1896, 122, 406-408. 

Hunsr, C. H. A spectrum top and a 
complementary One- Nature, Lond. 
1895, 51, 510. 

Ives, H. E. Visual diffusivity. Philos. 


Magazine, 1917, 33 (series 6), 18-33. 
Ives, H. E. The resolution of mixed 
ialvisual diffusiv- 


colours by differenti 


48. 


49. 


50. 


55. 


56. 


57. 


58. 


59. 


. LEMMON, V. W. 


. LUCKIESH, 


135 


ity. Philos. Magazine, 1918, 35 
(series 6), 413-421. 

Jupp, D. B. Color vision. In O. 
Glasser (Ed.), Medical Physics. 
Chicago: Year Book Pub., 1944. 
Pp. 265-282. 

KLorsreo, P. E. A curious color phe- 
nomenon. Science, 1931, 73 (new 
series), 590. 

Kormany, F. Die Erzeugung eines 
kontinuierlichen Spektrums auf 
Grund der Benhamschen Farben. Z. 
Sinnesphysiol., 1931, 62, 158-166. 

Entoptic colors. 

Science, 1932, 75 (new series), 217. 


. LIVEING, GD. On Benham's arti- 


ficial spectrum top. Nature, Lond., 
1894, 51, 167. 


. LivrixG, G. D. Theartificial spectrum 


top. Nature, Lond., 1894, 51, 200. 
M., & Moss, F. K. A 
demonstrational test of vision. 
Amer. J. Psychol., 1933, 45, 135-139. 

Newton & Co. The artificial spec- 
trum top. Nature, Lond., 1895, 51, 
463. 

pautt, R, & WENZL, A. Uber Far- 
benempfindungen bei intermittier- 
endem farblosen Lichte. Arch. ges. 
Psychol., 1924, 48, 470-484. 

PERCIVAL, A.S. Colour phenomena due 
to intermittent stimulation with 
light: note on the colours of Ben- 
ham's top. Trans. ophthal. Soc. 
U.K., 1909, 29, 119-125. 

préron, H. Le méchanisme d'appari- 
tion des couleurs subjectives de 
Fechner-Benham. Année psychol., 
1922, 23, 1-49. 

Péron, H. Des lois du déséquilibre 
chromatique initial et de la pré- 
pondérance de la diffusion chro- 
matique dans l'excitation lumi- 
neuse de la rétine. (Méchanisme de 
production des couleurs subjectives 
de Fechner-Benham). C. R. Soc. 
Biol. Paris, 1922, 86, 922-925. 

Péron, H. Le mécanisme des 
couleurs subjectives de Fechner- 


136 JOZEF COHEN AND DONALD A. GORDON 


Benham. J. de Psychol., 1923, 20, 
75-80. 

61. Pifron, H. Influence de la compo- 
sition de la luminiére sur la nature 
des couleurs subjectives de Fechner- 
Benham. Données complémen- 
taires. Année psychol., 1928, 29, 
229-233. 

62. Prevost, B. Sur une apparence de 
décomposition de la lumiére blanche 
par le mouvement du corps qui la 
réfléchit. Mémoires de la Société de 
Physique et d'Histoire naturelle de 
Genéva, 1823-26, 3, 121. 

63. Roop, O. N. Ona new theory of light, 
proposed by John Smith. Amer. J. 
Sci. and Arts, 1860, 30 (2nd series), 
182-186. 

64. R(oop), O. N. Subjective optical phe- 
nomena, Amer. J. Sci. and Arts, 
1861, 31 (2nd series), 417. 

65. SANFORD, E. C. A laboratory course in 
physiological psychology: IV. Amer. 
J. Psychol., 1893, 5, 390-415. 

66. SCHWARTZE, T. Grundgesetze der 
Molekularphysik. Leipzig: J. J. 
Weber, 1896. Pp. 200-206. 

67. SEGAL, J. Une nouvelle manifestation 
de couleurs subjectives. C. R. Soc. 
Biol. Paris, 1940, 133, 259-261. 

68. Sexton, F. P. The spectrum top. 
Phys. Soc. Lond., 1907-08, 21, 392- 
395. 

69. SILBERSTEIN, L. Investigations on the 
intrinsic properties of the color do- 
main. J. opt. Soc. Amer.,*1938, 28, 
63-85. 

70. SILBERSTEIN, L. Investigations on the 
intrinsic properties of the color do- 
main. II. J. opt. Soc. Amer., 1943, 
33, 1-10. 

71. SKINNER, B. F. A paradoxical color 
effect. J. gen. Psychol., 1932, 7, 481- 
482. 

72. Situ, F. J. Apparent decomposition 
of sunlight by intermittent reflecting 


Received. July 14, 1948. 


73. 


74. 


75. 


76. 


Ths 


78. 


79. 


80. 


81. 


82. 


83. 


84, 


85. 


86. 


surfaces. Nature, Lond., 1881, 24, 
140. 

Satu, J. On the production of colour 
and the theory of light. Rep. Brit. 
Ass. Adv. Sci. (Notices and Ab- 
stracts), 1859, 29, 22-23. 

Smita, J. On the cause of colour and 
the theory of light. Amer. J. Sci.and 
Arts, 1860, 29 (2nd series), 276-278. 

Surrn, N. Colour perception. Nature, 
Lond., 1882, 26, 30-31. 

Stewart, G. N. Is the law of Talbot 
true for very rapidly intermittent 
light? Proc. roy. Soc. Edinb., 1887- 
88, 15, 441-464. 

Stewart, G. N. Color phenomena 
caused by intermittent stimulation 
with white light. Amer. J. Physiol., 
1924, 69, 337-353. 

TROLAND, L. T. The enigma of color 
vision. Amer. J. physiol. Opt., 1921, 
2, 23-48. 

TROLAND, L. T. The Principles of 
Psychophysiology (Vol. I). New 
York: D. van Nostrand Co., 1930. 
P; 250; 

Turner, D. The spectrum top. 
Nature, Lond., 1895, 51, 438. 

Von Kries, J. In H.v. Helmholtz, 
Treatise on physiological optics. 
(Vol. II) (3rd él) (Ed. J. E. C. 
Southall). Menasha, Wis., George 
Banta Publishing Co. for the Optical 
Society of America, 1924, Pp. 449- 
450. 

Way, E. F. Entoptic colors. Science, 
1932, 75 (new series), 81. 

WEYL, H. Mind and nature. Phila- 
delphia: University of Pennsyl- 
vania Press, 1934, Pp. 1-19. 

Anon. An “artificial spectrum top." 
Nature, Lond., 1894, 51, 113-114. 
Anon. An artificial spectrum. Scien- 
tific American, 1895, 72, 309-310. 
Anon. The “artificial spectrum” or 


new colour top. Engineering, 1895, 
59, 381. 


AN EXTENSION OF CONTROL GROUP DESIGN 


RICHARD L. SOLOMON! 
Harvard University 


. d elaboration of existing experimental desi 
ml i an experimental group and a control group are he 
peciall ri t is intended asa recommendation to psychologists, es- 
aoe ose working in these specific fields: (1) transfer of training 
nobles ents; (2) experiments on induced changes in existing attitudes, 
ici gin personal values; and (3) experiments on the effects of 
Ekiti et experience on responses, skills, and performances already 
iiis. e in the behavior repertoire. We feel that the use of the recom- 
ain e design and procedures will provide information of a valuable 
, information which cannot be obtained with the use of existing 


standard research designs. 
BRIEF HISTORY 


This paper presents an 


U The history of the idea ofa controlled experiment is a long one. 
bs mia one goes back to J: S. Mill's canons (10) for the concept of ex- 
RR controls. "Those readers in in the various types of 
iter s which have been suggested in the history of psychological re- 
(17 may be referred to Fechner (6), Henri (9), Wundt (21), Titchener 
), Baldwin (1), and Warren (18). However, if one is interested solely 

y: Twentieth 


1 Group in the history of psycholog 
i We have not been able to 


C 3 

entury psychology contains t 

of control group design, as we use it today, 
have awaited the 


before the year 1901. Control group design seems to 


development of statistical concepts which allow for the characterization 
s of measures of central tendency; and 


stical sophistica- 
with the fact that the 


Psychologists seem to have been slo 
control group in the three types of 


tion wi : i 
use I experimental design. We 
of an experimental group and a 


exper} : 
periments cited above is a very recent development. The earliest use 
esign in the study of transfer of training 


of this k; 
is han kind of experimental d 
und in the work of Thorndike and Woodworth (16), published in 
C. F. Mosteller and Professor 


rT 
he author wishes to express appreciati i ia ci 
per. rofessor i5. 


S.A 
Bo Stouffer for many original and helpful à 
3 Ami kindly contributed many interesting historical notes. 
versity. is research was facilitated by the Laboratory 
137 


Harvard Uni 
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1901. These investigators used independent educational groups as con- 
trols on other groups. They were studying the influence of various 
educational sequences on proficiency in various subjects. In their ob- 
servations a sequence of training such as ABC could be evaluated 
against a control group with sequence ADC. They attributed any dif- 
ferences in performance in C in terms of differential effects of B and D. 
Thorndike and Woodworth were examining educational sequences as 
they found them. In the actual experimental manipulation of groups, 
in transfer of training experiments, it is interesting that control group 
methodology was not being used at the time that Thorndike and 
Woodworth published their accounts. For example, we find a typical 
experiment in transfer of training by Ebert and Meumann (4) in 
1904 in which one group was used, the typical experimental group 
receiving this sequence; pre-test, training, post-test. This procedure 
was criticized by Dearborn (3) in 1909, and by Sleight (14) in 1911. 


SY goes, we fi i ‘ , : 
Ewert (5) and Bray 8 ve find that in the middle 'twenties 


; e term “Control Group” for the 
group that received no training. Warren included this term in his 


Dictionary (18) in 1934, „Yet Harriman's Tecent New Dictionary (8) 
did not include it. If one is still skeptical about the relative newness of 


The idea of co-twin control and litter-mate c 
ontrol tered be- 
tween 1930 and 1940. li ii 


— i 


s 
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on changes of attitudes, opinions, and 
n much advantage of the lessons in ex- 
from transfer of training experiments. 
to give a group a pre-test, using an 
acceptable attitude scale, then subject the group to educational pro- 
cedures of some sort, and then post-test the group with the same test 
or an equivalent form of it. Without a control group it is impossible to 
evaluate cither the effects of the educational procedure or the influence 
of taking the attitude scale. The experiment of Peterson and Thurstone 
(12) on the effects of motion pictures upon the attitudes of children 
he first serious attempt to use control groups for the pur- 
s of the educational procedures. This was 
and Newcomb (11, pp. 946-979) 


The experimental studies 
personal values have not take 
perimental design stemming 
The common procedure has been 


represented t 
poses of evaluating the effect: 


published in 1933. Murphy, Murphy, 
in their summary of experiments on the modification of attitudes, listed 


many recent experiments where no control group was used. 

In general, the arguments for the use of a control group apply in 
the same way to the three fields of endeavor listed in our Introduction. 
Woodworth (20) has included an exhaustive account of experimental 
design in his discussion of transfer of training experiments. More re- 
cently, Gagné, Foster, and Crowley (7) have reviewed this problem of 
experimental design in the measurement of transfer effects, Many of 
their conclusions apply to experiments on induced attitude changes and 
the acquisition of skills. To review their recent article is unnecessary 
the designs for experiments are thoroughly outlined there. The impor- 
tant point we wish to make is that their recommendations involved the 
use of an experimental group which receives controlled restitit pad 
a control group which does not. For the three types of pa parT is 
shall discuss, the current standard designs may be tabulated as follows: 


1 ining: 
(1) Transfer of training Experimental Control 
Group Group 
es 
Pre-test on Task: dm 2 
Training on Task: i. yes 


Post-test on Task: 
es in altitudes, opinions, or values: 


(2) Experimentally induced chang 
Experimental Control 
Group Group 
yes 
Pre-test on Attitude: Bi ne 
Education, Propaganda a ya 


Post-test on Attitude: 
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(3) The effects of controlled experience on any existing behavior: 
Experimental Control 
Group Group 
Pre-test on Present Behavior, yes yes 
Experimental Treatment yes no 
Post-test on Final Behavior, yes yes 


With matched groups, or with matched tasks, both Woodworth (20) 
and Gagné, Foster, and Crowley (7) listed appropriate substitute de- 
signs, but all are based upon an experimental and a control group. It is 
understood that the two groups can be matched by one criterion or an- 
other or may be randomly drawn from the population. The pre-test 


some logical manner. It is important to state that the same scale units 
should be involved in both the pre-test and post-test. 

We have attempted to follow the history of control group methodol- 
ogy in a sketchy way, and have outlined what present practice seems 
to be. We now wish to proceed to an extension of present practice to 
what we believe is a more adequate design of experimentation. 


AN EXTENSION OF EXPERIMENTAL DESIGN 
The experimental plan outlined ab 


for analysis: (1) assignment of a numerical value for the effects of 
the pre-test as this factor operates on the post 


- This procedure is discussed by Gagné, 


EXTENSION OF CONTROL GROUP DESIGN 141 


pens in many experiments of the sorts we have discussed. We feel that 
the pre-test operates directly upon the effectiveness of training, or 
interacts with the training process. That is, there is a great possibility 
that merely taking a pre-test changes the subjects' attitudes toward the 
training procedures. Or it may conceivably change the “set” or atten- 
tional factors important to the effectiveness of training. Thirdly,.it 
may actually change the manner in which the subjects "perceive" the 
training material. These psychological possibilities may be termed ‘‘in- 
teractions.” Thus, if a group of subjects were given a pre-test on spelling 
ability and the test list contained a lot of ie and ei vowel pairs, it is 
perfectly conceivable that the subjects would “pay attention to" cer- 
tain relevant portions of a spelling lesson in a manner very different 
than if they had not received the pre-test at all. Or, in the case of at- 
titude experiments, the way that the pre-test items are worded might 
influence the manner in which the propaganda or education procedures 
are “received” by the subjects. We believe that this kind of pre-test- 
training interaction is important, and that it occurs in a large number 
of experiments of the variety we have listed above. We have found 
that these interaction effects can be quantified. The following control 


group design is necessary to do it: 
Experimental Control Control 
Group Group I Group II 
Pre-test yes yes =A 
Training, Education yes no ye 
Post-test yes yes y 


The question immediately arises concerning the anil tox 
taining an improvement (or change) score for conte gos X. p ‘ 
we believe, may be legitimately done by statistical in pes : de E 
that: (1) the groups are either reasonably matche at a = ui 
the experiment on the basis of as many criteria as poss! le or 2 nerd 
domly drawn from the population; (2) the means and em eos 
experimental group and the control group Í are known tOr pem : 
and (3) the N’s for all three groups are reliably large- This latter ge 
ification is, of course; partly dependent upon the prior pem o n 
experimenter with performances of samples on the tæk in ed T 
We take these three provisions into account, We can infer w a : pr 
ses score for control group II would = = Marii ee A "- x 

aken the pre-test. The grand mean pre-test sc } 
group an l kk. I (pooled data) represents the best pe 
of what the mean of the control grou een 


pH would have b 
group taken the pre-test. We shall call this value uj”, the inferred pre- 
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test score" for control group II. Table I outlines the numerical values 
which must be fitted into the analysis in order for the interaction effects 
to be quantified. 


TABLE I 


NUMERICAL VALUES NEEDED FOR THE ANALYsISs oF INTERACTION 
EFFECTS IN A THREE-GROUP DESIGN 


Experimental Control Control 
Group Group I Group II 
:__ ata: 

Pre-test Means 01 Oma, (P1) 02 X Om (P1) D Mj Eiaa 
Training yes (Tı) no yes (Ti) 
Post-test bi toms, bo t0mb, bs tomb, 
Improvement or Change 
in Means di —b—a, d: — b; —a; d; =b; —i 


attentional, perceptual 


with reference to Tabl i 
eiue € I what we have done is to as- 


4 consideratio. “this 2, P 
assumption that the post- ; 5 : ns here; this is equivalent to the 
ing due to its use. 
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of the improvement of the experimental group to be equal to the sum 


of the improvement scores of the two control groups is a reflection of in- 


teraction effects. These effectsare 
There are many ways to interpre 
example, if an interaction term is relative 


provement of the experimental group, and i 
at the pre-test somehow facilitated the effective use of 


es. We then might start on a search for the kinds of 
attitudinal and perceptual changes that the pre-test had produced, since 
such changes might have a generality of application for many kinds of 
educational procedures. In attitude testing, à negative I term would 
lead one to suspect that the pre-test developed some kind of resistance 
to the propaganda material. This problem then could be approached 
by new experiments. It is possible that rules.for building pre-tests 
which make subjects receptive to education might come out of such a 
program. In other words, the magnitude and direction of I ought to 


provide some hints to the experimenter. 
We have indicated the various possible psychological statuses of the 


interaction term, J. There still remains the challenging problem con- 
cerning the psychological status of training alone. It is hard to conceive 
of a change in behavior due to training alone. Life history has produced 


certain attitudes which must inevitably interact with training proce- 
is important here, together with the 


dures. The problem of familiarity i 
total personality organization. In order to treat logically the concept 
of training alone it seems necessary to postulate some prior state, A, 
which is common to all three groups at the time of the pre-test admin- 

teract with A under all 


istration. Then training may be inferred to in 
This construction enables us to state 


conditions in the experiment. c 7 
that, all other things being equal, the improvement of control group 
is due to training alone Or to the training-A interaction, piper one 
might prefer. Parenthetically it might be added that a study o x 
variance during successive stages in the training procedure might revea 
some of the training-A interactions. Much work needs to be done in 
elucidati is problem. 

jeter m artus in the listing of all of the permutations and 
combinations of outcomes possible within Table Ilet us examine a small 


demonstration experiment which we have completed in order to portray 
one use to which the three-group design may be put. 

RIMENT, THREE-GROUP DESIGN 

a fifth and a sixth grade, were used 
ch class were assigned to the three 


quantified as follows: I =dı— (da-d-d3). 
t an obtained interaction. For 
ly large compared with the im- 
f it is positive in sign, we 


might assume th 
training procedur 


A SAMPLE EXPE 


Two grammar school classes, 
in this experiment. The pupils in €a 
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groups required in the new design. The groups were roughly equated 
in spelling ability by means of teachers judgments. The experimental 
group was pre-tested on a list of words of equal difficulty. Control group 
I was pre-tested at the same time on the same list of words. Control 
group II was out of the rooms at the time. Then the experimental group 
and control group II were given a standard spelling lesson, covering 
some general rules of spelling. Control Group I was out of the class- 
rooms during this training period. Then the 
the groups. It was composed of the same words as in the pre-test. The 
results for the two grades are presented in the tables below. It is re- 
grettable that the N for each group was not larger, but in order to ad- 
minister this experiment properly it was necessary to restrict the 
sample size. If we consider the sixth grade experiment to be a replication 
of the fifth grade experiment, the small sample size does not loom so 
importantly. It is to be noted that the variabilities are not inordinately 
large. We are not satisfied with the sample size, but we are willing to 


consider this experiment to be representative, highly indicative, and 
capable of reproduction.‘ 


The J term in Table II is acknowledged to be but à "best estimate." 
But an examination of the variabil 


ities is enough to reassure one that 
the chances are very small that the population J is Zero. Added assur- 
ance may be obtained by an examination of the sixth grade results in 
Table ITI. 


post-test was given to all of 


TABLE II 


SUMMARY or FIFTH GRAD 


E RESULTS OF AN EXPERIMENT 
IN SPELLING, USING A THREE-GROUP DESIGN 


Experimental Control Control 
Group Group I Group II 
(N=10) (N=10) (N=10) 

Pre-test yes (P3) yes (P1) no 
Pre-test means 21—3.2 0.8 @=2.8+0.7 i=3.0 X 0.6 — 1(a4-a3) 
Training yes (Tı) no yes (T3) 
Post-test yes yes yes 
Post-test means 5i—9.9 11.6 b2=3.540.8 bs=11.241.2 
Improvement means di—6.7 10.9 4=0.7+0.5 d,—82 +? 
Value of I d — (d34-d;) = —2.2 


‘ This experiment was carried out with the aid of Mr. Richard Walk, Harvard Uni- 
versity. We are greatly indebted to Mi i å 
School, Newton, Massachusetts, for her helpful Cooperatio 
and Miss Cecilia Dunlop, teachers, for indis 


BALERS POSS UHR 
TABLE Ill 


SUMMARY OF SIXTH GRADE RESULTS OF AN EXPERIMENT 
IN SPELLING, USING A TunEE-GRouP DESIGN 


Experimental Conirol Control 

Group Group I Group II 

(N=8) (N=9) (N=8) 
Pre-test yes (P3) yes (P3) no 
Pre-test means a=54+14 2,604515 i=5.7 40.8 — 321-23) 
Training yes (T) no yes (T) 
Post-test yes yes yes 
Post-test means b, 211.8::1.0 b.=6.8+14 by =14.440.3 


Improvement means di 26.4 x12 d.=0.8+0.6 ds3=8.74? 


Value of I d,—(det+ds) = —34 


obtain a sizeable interaction term with negative sign. 
to believe that this is an accidental occurrence 
he fifth grade results are comparable. The in- 
term offers interesting possi- 
e-test somehow diminished 
One possibility is that the 


Again, We 
We have little reason 
when we consider that t 
terpretation of the nega 
bilities. We know that t 
the effectiveness of the trainin 


pre-test was emotionally distur 
ly on the pre-test may have 


is that the writing down of a word erroneous 
rrors. An examinati 


the experimental group 
andards against which to evaluate the occur- 


ad used the ordinary two-group 


design (the experimental group and control group pt 
of the teaching proce d have be 
The pre-test has been sh amma 
of the teaching method. ave revealed a y 

the epee vac two-grou som the method used to —— 
tiveness of traini d erate to cut down effects 

aining procedures $ 

of training as they are capable of being measured bY ordinary gen 
Using control group II, however, allows us to evaluate iae p e | 
the effects of training or the success thod. 1 


Plications here for educational researc 


Four-GROUP DESIGN 


hree-grouP design are 


Harvard University; 


Tur USE OF À . 
in progress In 


Od i i t : 
her experiments using the placing special 


the Laboratory of Social Relations, 
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emphasis on the problem of experimentally-induced attitude changes. 
Preliminary experiments have revealed that large interactions often 
appear in such experiments, thus demonstrating the need for control 
group II. In addition, the mathematical implications of the three- 
group method are being explored. The analysis of data presented above 
was the simplest possible for the purpose. It seems clear that more 
sophisticated statistical procedures, such as an adaptation of the analysis 
of covariance, may enhance the evaluation of research results. In 
particular, the mathematical nature of J, the interaction term, needs 
to be investigated. Its meaning is at present obscure, and the reason 
for this obscurity may be revealed by Table IV. 


TABLE IV 


SUMMARY OF MEAN SCORES AVAILABLE FOR THE ISOLATION OF 
INTERACTION ErrECTS BETWEEN PRE-TEST AND TRAINING 
IN A THREE-GROUP DESIGN 


Pre-Test 
yes no 
A yes Exper. Group Control II 
niinc no Control I ?? 


If this table above could be filled out logically with mean improve- 
ment scores it seems clear that a straight analysis of variance, with a 
solution for the interaction between pre-test and training, would be 
feasible. But a group with no training and no pre-test is not sensible at 
all, with reference to the laboratory experiments with which we have 
been dealing. The post-test score for such a group would be in effect a 
pre-test, and no improvement score could emerge. Thus, that J is a 
"true" interaction magnitude may be questioned for the case of care- 
fully controlled laboratory experimentation. 

However, in the case of field studies, such as those on attitude 
changes in the American soldier as a result of propaganda,’ there is an 
opportunity to fill in the empty cell in the four-fold table, provided 
that we use a fourth group, control group III. In the case of such field 
studies we may consider that passage of time is an important variable, 
since many uncontrolled events may occur. For example, we may be 
giving the American soldier propaganda about the great strength of our 
ally, Russia. Sometime between the pre-test and the post-test, Russia 


5 We are indebted to Professor S. A. Stouffer for this information and for pointing out 
the necessity of a four-group design. 


4 


yi 
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Germans. This is an uncontrolled 
taking place in the period of time 
This event is assumed to im- 
The design of the experiment 
such a field experiment is 


may lose an important battle to the 
event, which we shall designate as Ei 
between the pre-test and the post-test. 
pinge upon all members of all four groups. 
required to reveal interaction effects in 
shown in Table V. 


TABLE V 


NUMERICAL VALUES NEEDED FOR THE ANALYSIS OF INTERACTION 
N A Four-GROUP DESIGN 


EFFECTS I 

Experi- Control Control Control 

mental Group I Group II Group III 

Group 
Pre-test yes yes no no 

. ditas . mtt: 

Pre-test means 01 X moi az tma, Wer fee 
Training yes (T) no yes (Tı) no 
Event, E yes (Zi yes (E) yes (Ei) yes (E) 
Post-test yes yes yes yes 
Post-test means bi £Omby bz 0n ba £Ombs | by EO mby | 
Change means d; -bi-a da =b- 2 d3=b3 —* d, -b,—i 
Interaction d, — (da +d3 — di) 


e now infer two pre-test means based on 
for control groups Il and III. Any change 
me between the pre-test and post-test for 
effects of event Ey. Since all of 
the following statements may be 


It is to be noted that w 
statistical assumptions: those 
in score taking place in the tir 
group III would be attributed to the 


the groups are exposed to this event, the | : 
made: (1) dy=f(Es), o^ the change in attitude for group III is a func: 


tion of the outside, uncontrolled event, Ex; (2) dift T3), or the 


change in attitude for control group II is a function of the training anc 
he change in attitude for contro 


the event, Ey; (3) df (Ei Pa) oF t 
came tes ope ts of the pre-test and the event £i 


group I is a function of the effec 
change in attitude for the experimenta 


(4) di - f(E h 
= Pit Ti+), ort e 
group is due to the summative action of the pre-test, the event Es th 
training or education process interaction between the pre-tes 
and the succeeding events in time (£i and T). Admittedly, this analysi 
seems destined to be cumbersome under many pourri ra 
i ight the 1 
appears in order to uncover and numerically weig 
gere en fold table (Tab 


teraction phen We note now that our four 
omenon. e note r 0 | 
s e this design, by inserting contri 


IV) above can be filled out when we us 
group III into the lower right-hand cell. 
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In view of our argument above, is it still impossible to fill in the 
fourth cell of the four-fold table for the three-group design? True, we left 
that table with some question marks in the fourth cell; but we believe 
in the light of an analysis of the four-group design, that it may be pos- 
sible to do something about the missing cell. The most obvious step 
would be to consider the three-group design as indicative of measures în 
time. A group with no pre-test and no training could not vary with time 
insofar as our measures apply. We could therefore infer a zero improve- 
ment score for the fourth cell. This seems a fairly sound procedure in 
well-controlled laboratory experiments, but it is subject to these dan- 
gers: (1) an unknown event, E, may have affected our three groups, so 
that if we actually had a fourth group, a difference score would have 
appeared; (2) time passage, maturation, or growth processes may have 
made the obtained post-test score different from the “true” pre-test 
score. Actually, if these cases do occur, they require the four-group de- 
sign in order to analyze adequately the experimental interaction. This 
means that many laboratory experiments in learning, motivation, 
nutrition, and maturation, have been inadequately designed. This is 
especially true of cases where some pre-condition of the animal has been 
measured in order to give the experimental measure a baseline. For ex- 
ample, in the lever-pressing experiment, we often obtain a sample of 
lever-pressing activity in a pre-test, to get a baseline for unrewarded 
behavior in this class. But these animals all grow older, and this can 
be called event E. Thus, in order to assess adequately the improvement 
or learning scores, a four-group design seems to be appropriate. We 
are presenting this argument for serious consideration. Actually, at 
present we are not certain that such elaborate control group design is 
absolutely necessary; but we suspect that it is, especially in cases where 
accurate quantification and the discovery of constants is the aim of the 
experimenter. 

That the interaction term, I, may be of importance in evaluating 
research which falls within the three main areas we have considered 
above, seems evident. We would like to exhort psychologists who work 
in these fields to study further the uses of the three-group and four- 
group experimental designs and report the results for discussion. It 
seems possible that many previously unquantified effects—attentional, 
attitudinal, or perceptual, may be revealed and weighted by using the 
suggested plans. At least that is the hope we entertain on the basis of 
preliminary indications.* 


6 Since the time this manuscript was submitted, a noteworthy finding has been 
studied. We have preliminary evidence that the pre-test may operate to reduce post- 
test variance in studies of attitude change. This is evidence of another sort of inter- 
action. It is, in effect, the limiting of variance by addition of restrictionsof "attention. 
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SUMMARY 


NF. = p an extension or modification of currently-used 
ies mn esign. We have shown that the use of a three-group 
B pi e Dey pl a four-group design, has potentialities for re- 
ient weigl ting certain interaction effects. These interaction 

ects may shed light on attentional, attitudinal, and perceptual factors 
which often are important in these three types of psychological experi- 
mentation: (1) transfer of training; (2) induced changes in attitudes 
opinions, and values; and (3) the effects of controlled experience em 
responses, skills, or performances which already exist in the behavior 
repertoire. Further study of the extension in experimental design is 


recommended. 
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ANIMAL HYPNOSIS 


THELMA T. GILMAN AND F. L. MARCUSE 
Cornell Behavior Farm, Cornell University 


hat the induced immobilization of animals has 
scarcely been dealt with in English psychological literature and is vir- 
tually extinct as an area of experimental investigation, the problem has 
had a protracted German-French history. Volgyesi (50) claims that 
the first reference to immobilization was made in 1636 by Daniel 
Schwenter, a mathematics professor at the University of Altdorf. Most 
authors, however, date its inception as an investigated phenomenon at 
1646, with the appearance of the "Experimentum mirabile de imagin- 
atione gallinae Kircher” (50). Kircher indicated that, by holding a hen 
in an uncomfortable its head, a state of entrance- 


position and fixing i 
ment could be induced when a € ts beak out- 


halk line was drawn from 1 
ward. Astounded by the “miraculous” nature of the phenomenon, 


Kircher concluded that the state stemmed from the hen’s imagination. 
Despite the novelty of the discovery; enthusiasm for the solution of the 
problem failed to find additional fertile ground, and the problem re- 


mained in obscurity. 


IN THE NINETEE 


Despite the fact t 


NTH AND TWENTIETH CENTURIES 


DEVELOPMENT - 
Not until the middle of the nineteenth century did immobilization 
of interest and investigation. Based 


of animals again becom 
: i i f visual fixation, some 

on Faria’ otion of the hypnotic efficacy © al fi 
nie tet n AT to e king steadily into the m 
of their prey. The literature of the period reflects the erp o 
Mesmer's concept of “animal magnetism.” Like Mesmer E i qu 
i i material em- 
the writers perpetuated the notion that netic : 


anated from the magnetizer to the magn . 
In 1834, Wilson wrote of the effect of ‘passes’ of a magnetic nature 


made upon fish, cats, dogs, hens pigs, geese, 4 horse, and a 
Describing the effect of these "' " Bantam hen and cock, he 


Says: 

The cock at first chuckled, and made ise; then he id ee 
remained so; sometimes sat down 8 ^ neither S ved waa 
stood upright for a considerable time, 
nor foot; and when the cage Was taken away» 

me to touch and pull his ge nb and gills, and to $ 
ng the slightest movement of 


etized by means of ‘passes. 
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In 1854, Newman mentioned the mesmerizing of birds and mice by 
snakes, and the taming of horses in half an hour by means of magnetic 
"passes." Describing the interest in animals at this time, Newman adds: 
Animals have been frequently fascinated for purposes of experiment, and a 
universal rigidity of the muscles produced to such an extent as to cause them 
to resemble pieces of statuary, so that the animal could be taken up and its 


whole weight supported by one foot and this state produced and continued at 
pleasure (26, pp. 16-17). 


In 1881, Preyer (30) had credited Braid with the observation that 
a phenomenon similar to human hypnosis existed in animals in moments 
of extreme danger. This reaction Braid called "monoideismus," a com- 
bination of cataplexy, hypnosis, fascination, and all other phenomena 
which temporarily disturbed the nervous system because of the limita- 
tions imposed on it by a focused attention. Braid cited the attraction 
of the bird to the staring snake as an example of monodeismic activity 
based on a single, dominant, attention-compelling stimulus. Preyer, 
however, emphasized cataplexy (a fear reaction) and is generally 
credited as proposing the idea of “fear” as a possible explanation of the 
condition. 

With the existence of the phenomenon firmly established in many 
forms of animal life, the literature (38, 48) now dealt with two principal 
aspects of the problem: the characteristics of the trance, and the nature 
and function of the state itself. The predominant explanations of the 
function or nature of the state were: the instinct to simulate death for 
protective purposes, hypnotism, or the paralysis of fear (cataplexy). 

In the twentieth century the work of Holmes (17, 18, 19) Hoagland 
(14, 15) and, to a lesser extent, Mowrer (25) and Foley (10) have helped 
to make a mark on the American psychological scene. Although the 
investigation in the United States has continued to be sparse, several 
studies have been reported in which an immobile state has played a 
secondary role. Boas and Landauer (2) note, for example, that the best 
readings for cardiac rate in fowls were obtained by turning the animals 
on their backs. They would remain in this position for 20 to 30 minutes 
at a time, and sometimes for as long as two hours. Recordings were 
then made in these inactive states. Marcuse and Moore likewise note 
that their tantrum pigs could be ‘‘quieted” for conditioning studies. 
Describing the agitation of the pig in the Pavlovian frame, the authors 

write: 

It was possible to quiet the animal completely merely by scratching or clipping 
her flank with a pair of scissors. The quieting effect was almost instantaneous. 
The grunting and struggling ceased, her eyes closed, her breathing became more 
regular and her whole body relaxed, often to the point of hanging limply in the 


1 ee m 


id 
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straps. Cessation of the clipping immediately resulted in a full-scale resurgence 


of “tantrum” behavior (22, pP 236-237). 

Derim (43) notes, in a study of narcesis in frogs, that the narcotiza- 

i time is shortened by turning the subject on its back. Minami and 

D allenbach (23) made use of the immobilization technique to compare 
quiet” and active periods in @ study of retroactive inhibition in the 


: The designation of the phenomenon. In a historical survey of the 
literature one finds that the name of the condition has varied consider- 


ably. It has been referred to in the earlier literature as mesmerism (52), 
and fascination or en i 
feint, while others used the terms cataplexy (30), 
of death (4), and immobilization reflex (37)- Minami 
(23) called the condition akinesis, while Mangold, earlier, had called it 
hypnotic akinesis. The state has also been referred to as hypertonicity 
(30), tonic immobility (10 14, 15), myotonia congenita (20, 44,) in- 
hibitory state (23), oF simply, i 
By 1938, Stroder (43) 
cerned with the designation © 
of the more recent papers On this problem, cites 
tempts to distinguish between their possi 
HE TRANCE 


tten that over 100 works were con- 
Steiniger (40), in one 
14 such terms and at- 


Tue INDUCTION OF T 
Common denominators OF differential factors in the types of in- 
ducing stimuli have not been systematically dealt with, and one finds 
s to a variety of methods em- 


scattered through the literature reference 
menters. The earliest method, introduced 


ployed by the respective experi | lu 
by Kircher (24), consisted o holding the hen in an unusual position, 
with its head held down on à flat surface while a chalk line was drawn 
from its beak outwards. 
traneous factor. 

Carrington (5) 
fixation of the eye 
(9) considers the natu 
Hoagland (15) names stroking betwee 
or sudden dorsal position. Rabaud (31), on | 
immobility is neVer invoked by à sensory influence, 
shock or concussion- 

Foley (10) classified t 
categories: 

Repetitive stimuli. The use of this method i 


nduction: bodily posture, 
nters. Dearborn 


imulus to be tactile or visual. 


n the eyes 
the other hand, claims that 


but by a sudden 


lists three principles of i 


he methods into the following four principal 


nvolved the stroking of 
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body parts such as might have been used in mesmeric "passes." Mow- 
rer (25), for example, describes stroking the comb of the rooster until 
it became quiet. Bonnet and Saboul (3) mention scratching the area 
around a frog's eyes, pinching its nostrils, or closing its eyes. Repetitive 
methods have sometimes consisted of “gazing” into the eyes of the 
animal in the fashion initiated by Braid (50), or by the sudden presenta- 
tion of a light stimulus. Mowrer (25) mentioned that the sudden pre- 
sentation of strong photographic lights accidentally induced immobility 
in his rooster, an incident which first seems to have stimulated his in- 
terest in the problem. 

Another method which might be considered repetitive is the "swing" 
technique where the animal is generally held in the hands of the ex- 
perimenter and swung gently back and forth several times (50, 51). 
When the animal is set down, it is usually immobilized. Parker tells 
of suspending a sea anemone from a hook and enclosing it in a glass cage. 
The contracted condition can be easily induced by moving the glass vessel in 
which the sea anemone is suspended back and forth a little, whereby the sea 
anemone is made to swing and strike on the sides of the vessel. By thus stimu- 


lating the sea anemone from time to time, it can be kept in a condition of con- 
tinuous and extreme contraction (27, p. 51). 


Marcuse and Moore (22) also made use of a repetitive technique via 
continuous “‘stroking”’ or scratching at the hide of a pig. 

Pressure on body parts. Some investigators felt that exerting pressure 
on a particular part of the body, the precise part differing with the spe- 
cies, would induce the state (31). Minami and Dallenbach (23) induced 
"rest" in their cockroaches by pressing a finger lightly against the 
abdominal area of the inverted subject. 

Inversion. Other investigators have utilized the technique of sudden 
inversion. Boas and Landauer (2) mention turning the hens on their 
backs to keep them quiet during cardiac recordings. Rode (37) turned 
owls on their backs to induce the immobility, while Reisinger (34) 
achieved the condition in this way with Indian locusts. Stroder (43) 
applied this method to frogs. 

Restraint from movement. Although Foley cites this as only one of 
several methods, some degree of restraint seems to have been present 
in all the investigations. Marcuse and Moore (22) had their pigs re- 
stricted in a Pavlovian frame. Mowrer (25) held the rooster's legs, while 
stroking his comb. Reisinger (34), working with crayfish, held the head 
and claws motionless, or restricted movement by holding the tail. 
Pavlov (28) restrained his dogs by the straps of the conditioning ap- 

paratus employed. Dearborn (9) restricted his animals from movement 
until efforts to move ceased. Ten Cate (46) held the octopus in his 
hand to prevent its moving. It is not difficult to infer from the work 
of Bonnet and Saboul (3), that to scratch or stroke any part of the frog; 
the animal must have been held down, thus restricting spontaneous 
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m beri 
ovement. Minami and Dallenbach (23), by exerting pressure on the 


insects, i icti 
, in effect, produced restriction of movement. Gilman, Marcuse 
ng movement of the 


and M i EH 1 

en er (12) immobilized chickens by restraini 

ce = p Inverting an animal, or holding it in an unusual pos- 
, would also produce restriction of movement. Similarly cases 


where ''vi inn" TER 
kary visual fixation” was thought to be effective are to be under- 
in terms of the physical restriction needed to permit “gazing.” 
hen he describes Volgyesi’s 


minaga (42) cited an example of this w 
se of the fixation method to immobilize a lion. Volgyesi sat on the 
the animal's head 


back of the lion, placing his hands on both sides of 
heir eyes could meet. Moll's reference to Kir- 


chers ridi ; : ; 3 7 
md s original experiment gives a more detailed and informative de- 
ription of the chalk-line, visual-fixation method. 


S d is held down on the ground; the head in particular is pressed down. A 
alk-line is then draw from the hen's beak. The hen 


wi : $ : igs 
ill remain motionless. he hen's imagination; he said 


it imagined it was fastened, tly did not try to move. Mach 
mentions Kircher's experiment. » + xample of the ease with which an 
experiment may be € ime the chalk-line was 
held to be the essential par er, it was shown that 
ie experiment could be successfully per holding the hen 
DEDE to the ground. ..* This much 

otionless in any position in which they have been 
+». in general, the stimulus used with the animals is tac 


Rijlant, describing his use of Kircher's m 


N^ : 
L'animal est imobilisé sur une table et une ligne 
tre en état d'hypnose € 


crest ses yeux. ++ l'animal en 
es heures. (35, P- 417). 


tile, .. . (24, p. 200) 


ethod, says: 
droite est tracée 4 la craie 
t s'y maintient pendant 


CHARACTERISTICS or THE TRANCE 
Some observations of the appearance of 


Status of the musculature. 
E ibe i cle contraction 4, 6, 15, 


the phenomenon descr 


21, 30), or ici 

, " atonicity (1, 45, 
beli that hy- 
elieve tha y 


cerned with the problem of muscle condition seem to 
eristic of the condition. Foley says that ^... 8 
he skeletal musculature 1$ characteristic 


co is charact 
oft degree of plastic tonus o 
int onic immobility" (10, p. 515)- Dearborn 
mind ...state of extrem 
clai p of the muscle” (9, p. 4 
du ms, that both a relaxed and cont 
of ring the course of a trance. Rode (37) observe 
his owl contracted, and then relaxed slowly: : 
is Heart rate. Although one finds some references to cardiac changes 
Gare immobilized state, little of the material is quantitative. Preyer 
) and Verworn (49) observed that heart rate increased. Sidis (39), 


observed that crayfish fall 
:ae often almost to a 

7). But it is also possible, as Rijlant (35) 
ndition may occur 


racted muscle co 
d that the entire body 
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however, claimed his frogs showed retarded cardiac rate. Beritoff (1), 
working with several vertebrates, also noted cardiac decrease, as did 
Marcuse and Moore (22), working with pigs. Rijlant (36), found no 
change in the heart rate of an immobilized rabbit. Gilman, Marcuse 
and Moore (12) working with chickens found no significant difference 
in the heart rate of susceptible animals during trance, nonsusceptible, 
and control animals. 

Respiration. Prayer (30) referred to noisy respiration, while Verworn 
(49) and Coriat (6) observed accelerated respiration. Rode wrote: 
Dès que la Chevéche est “endormie” le fait le plus frappant est la modification du 
rhythme respiratoire qui s'accélére, alors que la cage thoracique se dilate beau- 
coup plus que pendant la respiration normale (37, p. 452). 


Parker (27), in his study of the sea amenome, observed that the excre- 
tion of carbon dioxide was greater in the process of contraction than in 
the process of relaxing; however, no significant differences in output 
were observed during the contracted, as compared to the relaxed, state. 
Sidis (39), Piéron (29) and Beritoff (1) claim that respiration decreased, 
while Ten Cate mentions that respiration was unaffected (46). 

Analgesia. Although the majority of experimenters believe analgesia 
to be present in the immobilized animal, there is no unanimity of opin- 
ion. Holmes (18) and Darwin (8) both describe the absence of pain- 
reaction in immobilized insects, while Stroder (43) mentions analgesia 
in immobilized frogs under surgery. Ten Cate (45) observed surgical 
analgesia in angelfish, Marcuse and Moore say: “On one occasion the 
experimenter inadvertently cut the animal with the scissors, drawing 
blood, but elicited no reaction” (22, p. 237). Coriat, however, says that 
“Tn neither the frog nor the guinea pig were there any signs of cutaneous 
anesthesia during the cataleptic states" (6, p. 344). 

Steiniger (40), in accordance with his concept of unimpaired sensa- 
tion during immobilization, believes that pain is felt by the immobilized 
animal but is not overtly manifested because of its inability to move. 

General reactivity. ln general, animals are said to be insensitive to 
stimulation, although diversity of opinion exists. Holmes (17), describ- 
ing Faria's beetle experiments, points out that the presence or absence 
of moving objects or sound did not affect immobility. In his own experi- 
ments with water scorpions, Holmes (18) found his subjects responsive 
to light, contact, moisture and sudden changes in temperature. Rabaud 
(32) implies decreased sensitivity when he says that contracted muscle 
(the immobilized state producing tonic contraction) is less excitable. 
Beritoff (1) claims thereisa decrease in reactivity; however, he qualifies 
this by pointing out that the immobile state is composed of two phases; 
active and inactive, and only in the first, active phase, is decreased 
reactivity found. Stroder (43) writes that all reactions are absent during 


immobilization. 


Bonnet and Saboul (3) found increased electrical stimulation neces 
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sary to elici : 
qud — from a frog. Foley (10) cites the “lack of overt 
cee, att = € stimulation" as another characteristic of the 
cie em dad le vm the condition, calls it "negative or quiescent 
Einen bs en the presence of disturbing stimulation." Hoagland 
pompes i eee pd to environmental stimuli as a characteristic 
meets in vs rough he adds that in lizards “the mere movement of 
re aa environment was often sufficient to bring about a pre- 
recovery” (15, P- 718). Rode (37) observed that, though the 


ve pps of auditory or visual stimuli was ineffective, the state 
bed eo terminated in the owl by touching its extremities. 
Mica. ad ( 3), working with crayfish, claimed that light or darkness, 
E ped were inconsequential in their effects on immobilized 
Ed > ramwell (4), Verworn (49), and Steiniger (40) believe that 

ation of the stimulus inan immobilized state is not diminished, even 


t a RR, P 
so reactivity is inhibited.’ 
epth of trance. In relation to the study of reactive behavior the 


elgg of the depth of the trance has received scant attention. At 
st, Beritoff (1) merely indicates that the condition is not always at 


t s t ; 
ub same level from the moment of inception to the moment of cessation. 
is opinion, however, is :mental support, while other 


investi s ; 
vestigators, apparently iform in depth at all 
ontrary opinion. 


Suscebtibili evidence to support the c [ 
Nom The problem © nst non-Sus- 
(14) ew jd not received Very extensiv ntion. 
caract abaud (32) believe that tonic immobility 1s of an all-or-none 
i se E Coriat mentions that suscep 
anim » Sometimes à given experiment wo e 
iiem al, on other occasions the experiment would be a failure with © 
ET animal" (6, p. 344). Darwin (8); writing of prolonged duration in 
site € species, mentions that other varieties will not “feign twice in 
Succession. Holmes (17) mentioned the app f nonsusceptibility 
nt times. McGonigal (21) ivi 


earance O 
mentions 
Dearborn observe 


individual 


i 

ee at different > 

ac e in susceptibility among the insect species. m 

that: “Susceptibility seems to increase distinctly with repetition of the 
very erratic” (9 P- 424). He, 


im : i 
ti Dum on the average» but this 1$ 
, Notes, variation in reaction from su 

ecific factors 


on Several investigators have observed the in 

Susceptibility. Holmes (18 i f the trance 

arification. Subjects may react, in the eyes of 
rceptible movements. On the other 


an >; 

ma eh because the clearly Pe à c unnoticed 

eye bli e subjects may make movements 5° m | that they are x ^ go e 

consid inking, minute attempts at limb flexion, zation) or € n o 4 servel Ta 
EUM only partial movements, and thus may or be considere a reaction. 
Stor reactions, then, may be total (spontaneous rise), r limb movements) 


minim, 
al, or completely lacking. 


1 
The concept of “reactivity” needs cl: 
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when the subjects were under water (the natural habitat of water scor- 
pions) was very difficult. Hoagland (13) claimed that amyl nitrate 
prohibited the appearance of the state. 

Some investigators have attempted to determine the relationship of 
the mechanisms of balance to the induction of the trance condition 
(15, 50). Extirpation of the vestibular centers, however, produced no 
difference in the susceptibility of animals to immobilization. Still others 
have attempted to find a relationship between cerebral ablations and 
the immobility reaction. Tonkikh (47), working with frogs, found no 
change when the hemispheres were removed. Verworn, describing the 
tonic contraction occurring in the state, says: "Dieser charakterische 
Symptomen-Complex tritt auch noch ein, wenn man den Thieren das 
Grosshirn herausgenommen hat" (49, p. 154). The possible effect of 
taming had received no attention until the work of Steiniger (40). He 
found that birds confined to cages in a laboratory were less suceptible 
than birds allowed to roam freely. Comparisons of free and "tame" 
animals were made on the basis of the tendency to spontaneous im- 
mobility, induced by the presence of human beings. This difference, he 
explains, is due to diminution of fear in the “tame” animals by virtue 
of their relatively greater familiarity with man. Gilman, Marcuse and 
Moore (12) found that “taming,” Ze. experience with handling by 
humans, significantly decreases susceptibility in chickens. 

Duration. Piéron, studying frogs, said: ' . . . on assure son immobil- 
ité pendant un long temps qui... peut devenir de plus en plus long 

... qu'on répète l'expérience" (29, p. 230). Moll (24), who also studied 
frogs, claimed that duration increased with repetition of the experience. 

Not all writers, however, believed that duration increased with 
repetition. Rabaud wrote: “Il ne me parait pas prouvé que cette durée 
s'acroisse ni qu'elle diminue avec les répétitions" (31, pp. 75-76). In 
Holmes' earliest paper (16), he indicates the diminution of trance dura- 
tion with repetition, and, ten years later (18), wrote that all the forms 
studied showed a gradually decreasing duration as successive "feints" 
were induced. McGonigal (21) stated that graphs of duration all 
showed a sharp peak at the beginning, and then declined slowly. 
Mowrer (25) also noted decreased duration. Hoagland states: “The 
duration of the periods of immobility vary considerably from species to 
species, and to a lesser degree with individuals within the species" (15, 
p. 427). He describes the character of variation of duration as varying 

rhythmically in most cases. So strongly did he feel the rhythm to be ap- 
parent that he explained deviations from the rhythm as predictive o 
death. 

Other writers merely note the duration periods in their respective 
subjects and mention the variability of the condition from time to time 
in a single subject, or between individual subjects. Rode (37) wrote 0 
his owl that duration was never less than 15 minutes, and often better 


E 
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a agis ow , Volgyesi (50) also found repeated, prolonged duration 
1 e owl. Minami and Dallenbach (23) write that the duration of 
inactivity in cockroaches is not long and varies considerably. 
" Several investigators have described the effect of particular factors 
uration. Holmes (18) believed that the use of a dim light produced 
a longer “feint,” while the trance could be shortened by the appearance 
of a bright light. Fabre, according to Holmes (17), found that extremes 
of heat or cold prolonged the trance in beetles. Reisinger (33), on the 
other hand, claimed that light and electricity did not affect the condition 
at all. Increased duration was obtained by Hoagland (13) with the 
administration of ergotamine and adrenalin. Rode (37) claimed that the 
state could be instantly terminated by touching the extremities of the 
owl on which he made his observations, but added that such provoked 
awakening was followed by a few seconds of torpor, particularly after 
a long immobilization. Dearborn (9) wrote that sensory stimulation 
precipitated the cessation of the trance. Removal of portions of the 


brains of water scorpions Was said to decrease duration (17). 
ttempted to explain the decrease in duration as 
A “ae 


a function of fatigue: Holmes sa! ..one would expect to find a 
diminution of the duration, and perfection of the response as 2 simple 
12). Rabaud, who disputed increase 

‘tion, made some allowance for fatigue when he 
+7 à la longue un phéno- 


admitted: “Peut-être, Cepen ant, intervient-il à ] 161 
mine de fatigue tel que i ;ennent plus à immobiliser 


ns ne parvie 
l'animal" (31, p. 76). But the numbe tions are not cited in the 


ber of repeti 
data, nor is one informed whether repetitions are immediately successive 
or extend over à period of days. ilman, Marcuse and Moore (12) 
report a decrease in trance immobilization trials extend 
$ of trials are given ina 


over a 21 day period- When the same j 
one day period no such decrease is found. These results are said to argue 
i rms of fatigue. 


against an explanation in te . : 
Phylogenetic distribution ^s survey (50) of the history of 
is," he menti iently varied group of anima 

etic spread of the phenomenon. 


dias s 
animal hypnosis, 
subjects to indicate the genera ead € p 
' the ity of becoming immobile under the 

tis characteristic of al 


‘ren also states ` ** 

influence of Isor: restriction E Gor 
species and om (11, x 158). iterature deals with the application 
of immobilization techniqu irds (2, 12, 41) 50, 51) reptiles such as 
the lizard (14), amphibians (crabs. (33), frog (54), octopus (46), insects 
such as water scorpions (16), cockroach (23), and a variety of mammals 
that includes the pig (22), monkey (10), dog (28), lion (50), and man 
(53). 

_ Some writers were 
in the insect group (7; 


... the lower in the phylogenetic 


of the opinion that the state reached its maximum 


17). Coriat felt that 
state was the anim 


al experimented upon, the 
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easier it became to produce a motionless state. Furthermore, the duration ol 
the state in the crustacean was longer than in those vertebrates which had a 
fairly differentiated nervous system. (6, pp. 344—345). 


If, however, duration of the state is the criterion, such a belief is far 
from tenable since extremely long periods of duration appear in many 
groups higher in the phylogenic scale. It would be safest to rest with 
the knowledge of the rather consistent appearance of the condition all 
along the biological scale. 

It is of interest to note that an analogous condition has occasionally 
been observed in the human being. The condition known as cataplexy 
takes the form of cataleptic fits which are elicitable by sudden emotion 
(fear, laughter). Wilson, presenting some illustrative case histories, 
concludes that it may be the perpetuation of “... a biologically old 
automatism, a defense-reaction in which immobility is of advantage to 
the frightened animal" (53, p. 51). 


Hoagland makes another observation on analogous behavior in 
humans. He writes: 


I have been able to produce the condition in adult human beings. If one bends 


forward from the waist through an angle of 90 degrees... and...is vio- 
lently thrown backwards... the skeletal muscles contract vigorously and a 
state of pronounced immobility . . . may result (14, p. 716). 


"EXPLANATIONS" OF THE PHENOMENON 


Reflex response and cerebral inhibition. Some writers have attempted 
to explain the state by dealing with what was believed to be the neural 
mechanism producing it. Rabaud says it is ‘... une proprieté fon- 
damentale du systeme nerveux et dérive, comme telle, de la constitution 
des organismes considerés; elle en est l'une des résultantes et ne saurait 
avoir d'autre raison d'étre" (32, p. 141). He goes on to explain that 
the alternating mobilization-immobilization impulses of the nervous 
system produce motion and rest, respectively, and thus the condition 
would arise. He sums up its biological significance by saying: ‘‘. . . Pim- 
mobilisation réflexe est un cas particuler, une simple exagération du 
tonus musculaire normal” (32, p. 128). 

Bonnet and Saboul (3) attribute the condition to a depression of 
reflexes caused by the modified function of the medullary centers- 
Verworn described the mechanism as an irritability of the cerebrum 
and inhibition of the motor areas of the brain. Introducing the idea © 
the ''Lagereflex," he wrote: "'Sie ist nichts anders als die Hemmung von 
spontanen Bewegungen oder Handlungen... wenn gleichzeitig ein 
starker Sinneseindruck uns intensiv erregt. .. . Das Wesentliche der 
Erscheinung ist vielmehr der tonische Lagereflex" (49, p. 54). 

Coriat speaks of “cerebral inhibition,” but adds, “I cannot agree 


| 


A 
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ira the hypothesis that cerebral inhibition is a reduction of psychic 
i sity. This inhibition of the brain probably took place through the 
nesthetic stimuli of the manipulations being poured into the central 
nervous system. . - « " (6, p. 346.) 
P Rijlant (35) believed the condition to be a result of the modified 

ate of the central nervous system which produced a state of muscle 
contraction. Pavlov (28) and Frolov (11) used neural inhibition as 


their explanation for the phenomenon. Ten Cate (45) emphasized 
as the causative agent. Holmes (17), although 


' rejects the explanation implicit in the 
ty to be some kind of "thigmotatic re- 
(49) of ““position-correction” is likewise 


permanent excitation 
using the term "death feint,’ 
term, and considers immobili 
sponse.” Verworn’s concept 
reflexive. 

Hoagland postulates the 
the central nervous system: 


existence of inhibitory hormones acting on 
“These hormones are assumed to produce 


tonic immobility, when present above threshold concentrations, by in- 
hibiting impulses from 'higher nervous centers’ and by allowing im- 
pulses to pass from the tonic centers to the muscles” (13, p. 842). 
Death feigning. The theory of death feigning, or the simulation of 
death, posits the idea that the animal, in the face of danger, instinctively 
succumbs to immobility as à protective guise. Darwin (8) was one of 
the first to speak of this idea, although he notes that “fainting” with 
the paralyzing effects of fear may be confused with the instinctive 
death-feint reaction. Some doubt may have entered his thinking, for 
he noted, in a comparison of dead and “feigning” insects, that the im- 
mobilized insects had not adopted the characteristic death posture. 


Bra 
mwell wrote: 
ous simulation of 


I am inclined to think that in many instances it is a consci 
death, adopted by the animals from the instinctive knowledge of the fact that 
f extreme hunger, will 


ey, except under pressure 0 


death, or catalepsy, was only shammed . .. 


certain birds and beasts of pr 

not attack what is dead. 

(4, p. 156). 

ee can be raised as to the 
eing within the intellectual rang 
, McGonigal raises à question o 

feint. 


possibility of such voluntary behavior 


e of subhuman species. 
n the teleological value of the death 


omenon is that, assuming with Darwin 


The in : " " 
tere ;hole phen 1 
sting point of this w = » pet aep d ee rer aney 

i is case teleology has out- 


t M 
ul process was develope pem 
ies (a presumption not 25 yet superseded), ol 
saie itself, for a passionate Or excitable individua! o Lowe y and 
RA limbs so rigid that he falls on his back. .«- (in) the power of any foe.-- 
mparatively helpless and exposed (21, P- 408). 
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This criticism would seem to be an important one, for just as one would 
posit an instinct to feign death, one would have to assume that animals 
of prey had an instinct to avoid death-feigning, or dead, animals. : 

Sleep. Several people have adopted the theory that the condition 1$ 
sleep, or some form ofit. Heubel and Wundt (24), Sidis (39) and Pavlov 
(28) support this hypothesis. Heubel and Wundt considered the resting 
conditions (sleep and immobility) to be the consequence of the cess" 
tion of external stimulation. Sidis, however, considered this a “hyp- 
noidal” or sub-waking condition and not true sleep. 

Hypnosis. Although the term “hypnosis” has been used by some 
investigators, it has customarily been used with the qualification, tso- 
called.” Erhard (30) has supported this theory, but the most vigorous 
exponent of hypnosis is Volgyesi (50) who criticized sharply those who 
used the term, ‘‘so-called animal hypnosis." He strongly believed the 
condition to be the result of hypnotic influence. Bramwell, however: 
had earlier indicated his opposition to the hypnosis theory, writing: 
The more one appreciates the complexities of the hypnotic state—the fact that 
itis essentially distinguished by an increased power of controlling the organism: 
without diminished consciousness Or volition—the more one must recognize 
that it finds no analogy in an animal deprived of its cerebral hemispheres, an 
discharging a continuous stream of nervous motor impulses from its lower 
centres (4, p. 157). 

Spatial disorientation. Verworn (49), Rabaud (31), and Hoagland 
(14) have attempted to describe immobility as primarily related to a? 
impairment of the customary spatial relationship of an animal and its 
environment. 

Paralysis of fear. The principle idea presented in this hypothesis is 
that of a paralysis produced by fear. Preyer (30) first emphasized the 
fear factor by speaking of “cataplexy.” McGonigal (21) implies the 
presence of fear, when he points out that Holmes’ successful attempt? 
at immobilizing water scorpions might not have occurred if the subject? 
had been permitted to remain in their natural habitat while being im” 
mobilized. Rujka (38) extends the fear explanation further by consider 
ing the state to be a form of anxiety reaction. Steiniger (40) believe? 
the state of inhibited reaction to be attributable to two factors: the 
psychological factor of fear which induces the condition, and the i^ 

stinctive reflexive basis for the phenomenon. Its ecological importance 
he says may have two explanations. Vertebrates in this condition ca” 
in an instance of approach or pursuit by an enemy, be overlooked bY 
= T oe ant pin the wart of thé immobilized animal- 
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Gilman, Marcuse and Moore (12) empha: 
a factor determining not only suscep 
They suggest 
reception are interactive in the sen 


proprioception may evoke fearful behavi 
he experimentall 


trance in chickens. 


be operative in eliciting t 


size the importance of fear as 
tibility but also duration of 


that fear and disrupted stimulus- 
se that interference with normal 


or and thus both factors may 
y induced immobile state. 


SUMMARY 


It has been found that under cert: 
be induced in a wide variety of animals. 


acteristics of this condition have, as a res 
ir conclusions. 


dition have be 


to some degree t 
in 1646, the principal hypotheses have 


nique, been discrepant in the 
deliberately inducing the con 
ature, but all seem to involve, 
movement. Since its description 


ain conditions à à 


‘trance’’ state can 
Attempts to picture the char- 
ult of inconsistencies in tech- 

A variety of methods for 
en referred to in the liter- 
he factor of restraint of 


1 disorientation and paralysis of 


been death feigning, hypnosis, spatia 
fear. 
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Evans, R. M. Introduction to color. New York: Wiley, 1948. Pp. xd 
327. 


Once in a blue moon a book appears which brings together many 
important developments in a field at a time when integration is im- 
peratively needed. The subject of color embraces a vast domain, rang- 
ing from physics to psychology on the scientific side and from industry 
to art on the applied side. The author of this book, who is Head of the 
Color Control Department of Eastman Kodak Company, is well 
equipped for his task with a background of training in optics and 
photography, with industrial experience in still and motion picture 
photography, and with an active interest in what the psychologist has 
to offer in unravelling the riddles of color. Asa result this book deals not 
only with basic physical considerations but also includes a good deal o 
phenomenological material. Applications to color photography, art, 
and design are not overlooked. Two of the most difficult aspects of the 
subject for the lay reader, mathematics and a highly technical nomen- 
clature, are adroitly side-stepped without any sacrifice in clarity and 
accuracy of exposition. Allin all this is a notable contribution to visua 
science. 

It is not possible within the confines of this review to give an ade- 
quate picture of the careful, detailed discussion of fundamentals and the 
vast amount of information this work contains. Anyone working Ser" 
ously in the field of color today must understand how visual stimuli are 
measured and specified, either in terms of the I.C.I. standard observer 
or by means of an equivalent system; he must be aware of and on guar 
against errors inherent in photometric, spectrophotometric, and color- 
imetric measurements, especially with respect to the newer fluorescent 
materials coming into use; and he must have some knowledge of the 
vagaries of light, paint, and pigment mixtures. Certain phenomena, 
like dichroism and metamerism, if not properly understood, can be the 
source of faulty inferences regarding both the nature of the stimulus an 
the behavior of the eye. These and many other matters concerning light 
sources and reflecting, transmitting, and mixture characteristics of solid» 
liquid, and colorant substances are fully discussed and illustrated bY 
means of excellent diagrams and superb colored plates. : 

The power of a quantitative treatment of color comes out particu- 
larly in the many uses to which the I.C.I. mixture diagram can be put. 
Not only does it provide a means of specifying dominant wave-length 
and purity of stimuli and the results of mixture of any number of lights, 
put it also shows changes in chromaticness of objects in different illum- 
inants and of mixtures of dyes, paints, or pigments in various amounts 
or concentrations. Maximum purity theoretically obtainable from suf- 
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=n m arro are thereby easily and accurately diagrammed. 
ee j ions this method of specification of stimuli should be 
ode: known and used in psychology. 
ibn cn with detailed handling of the physical conditions under which 
isis ess seen will also be found a wealth of phenomenological observa- 
Eee imately related to physical and physiological antecedents. The 
i ssion is not limited to color in the narrower sense, for matters like 
perception of depth, the texture of surfaces, and the nature of shad- 
ows are explained by reference to position and size of light source, the 
reflecting properties of objects, and the like. It is in this area, the meet-- 
ing place of psychology and physics, th rer believes psychol- 


x at this review 
ogists will find the book of greatest interest and value even though the 
author’s concept of psychology does not seem W 


tholly satisfactory. Let 
us turn to this question since it is basic to most o 


k f the work being done 
in the field of color whether by psychologists or others. 

The key to Evans’ treatment of color as most psychologists under- 
stand the concept is found in the position adopted in the 1943-44 Re- 
port of the Colorimetry Committee of the Optical Society of America 
divided into three parts: (1) physics, which deals 
fiers of light energy; (2) psychophysics, which 
f radiant energy by the eye under a single, 
specified set of conditions, viz., matching small foveal stimuli against a 
completely dark background; and (3) psychology, which deals with 
factors like the modes of appearance of Katz, so-called perceptual at- 
tributes of vision like texture, Size: tre and glossiness, 
and other factors like attention, feelings, attitudes, and intentions as 
they affect colors. Psychology is regarded as having to do only with sub- 
jective, qualitative matter tion will come a 
Who believe in and work in psychology a$ an experime 
Science. . 
old division of color seems a5 if it might offer 

hich beset workers in the 
ut as the realm of physics is left behind and the 
Organism enters more an :nto the total process, the split between 
Psychophysics and psycholog d confusing. The men- 
tal keeps popping into the exposition, upsetting the scientific applecart 
again and again. At times the aut i re of the validity 
of his fundamental position, 8-8- when he says: "This difference (in the 
Way of looking at the stimulus) suggests that what has been described 


as a difference in attitude may, 1n the last analysis, be found to be a 
difference primarily in the way the eyes are used rather than in the 
mental phase. A consequent decision that the effect is purely psycho- 
Physical may be involved" (p. 166). To the psychologist itis obvious 
hat the definition of psychophysics adopted bY the Colorimetry Com- 
Ittee is too narrow and leads to insuperable difficulties for any con- 
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sideration of color based on it. Since the time of Fechner a number of 
psychophysical methods other than the null method of the standard 
observer have been found to yield statistically reliable results in studies 
of vision, hearing, kinaesthesis, taste, and smell. That some of the 
difficult problems of colorimetry, e.g., the problem of color differences, 
and so-called color appearances, will not be solved by taking results 
from the method of just noticeable differences or the null method of 
gauging the spectrum, and forcing them to yield information about 
equal sense distances or equal ratios, must be clear to all by now. 

definition of psychophysics that includes the procedures made possible 
by the work of such psychologists as Urban, S. S. Stevens, Guilford, and 
certain contributions of Thurstone and others might prove very useful 
in approaching color problems now relegated to the psyche. E 

On the other hand, due weight is given in this book to factors like 
adaptation level and contrast in explaining color phenomena which 
many psychologists have been content to label “constancy” and let go 
at that. Part of the reason for abandoning these concepts when dealing 
with phenomena of constancy and color conversion may be due to the 
fact they were conceptualized too simply. As Evans points out, there 

are at least three types of adaptation: general, local, and lateral (with 
temporal effects to complicate each, it may be added); and three types 
of contrast: subject, object, and illumination contrast. Whether or not 
these types of adaptation and contrast are merely descriptive or are 
fundamentally different is still an open question. 

Many psychologists will miss consideration of color theories and a 
good deal of information which physiology supplies in connection wit? 
this topic. Evans seems to find the tri-receptor theory plus a brightness 
receptor and a new concept, “‘cross talk” among nerve fibers, adequate 
for explaining a number of crucial phenomena. Until an alternative 
theory to the Young-Helmholtz theory comes along which is equally 
simple conceptually and as convenient for quantitative work we may 
expect people to go on using the older formulation in spite of its know” 
shortcomings. 

To sum up: this book constitutes a very important contribution t9 
visual science and will exert considerable influence for a number 9 
reasons. First, it makes available a great deal of material scattered 1” 
scientific, technical, and trade publications. Second, it relates a large 
number of phenomenological data to their antecedent physical conditi 
tions. Third, it opens up many possibilities for further investigation t° 
the discerning reader. That the psychologist can play an important n 
in the future of visual science no one can doubt after reading this books 
because it shows that many of the problems are predominantly psycho: 
logical in character. Progress will come through the application E 
quantitative methods to color phenomena now relegated to mere ie @ 
pearance,” or to subjective factors like intentions or illusions. Ev 
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has brought visual science fart 
deal that used to be thought t 
to objective treatment. 
we may regard this book as a contribution by 
psychological slant, then we may hazard the op 
for another book by a psyc 


Systematic attempts to apP 


b. cupide of personality which compris 
Which mediate between S an 
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her along the way by showing that a good 
o be "purely psychological" is amenable 
Much still remains to be done, however. If 
a physicist with a strong 
inion that there is room 


hologist with a strong physical slant! 
Harry HELSON. 


Bryn Mawr College. 
Doos, LEONARD W. Public opinion and propaganda. New York: Holt, 
1948, $3.75. 


Teachers of courses in the psychology of public opinion and propa- 


If Doob's new text does not fulfill 


ganda surely need a good textbook. 
theless—and by far—the best text to be 


this need completely, it is none 
had. 
A word first about its poin d organization. The author 
takes as his systematic theme, opening chapters, the 
1. With varying suc- 


behavior theory of Clark Hull and the Yale Schoo f ryi 
cess, the chapters which follow attempt to apply Hullian principles to 
an understanding of attitude formation and maintenance, the develop- 
ment of public opinion, the princip paganda, and the myriad 
of behaviors which constitute mass communication. There are twenty- 
one chapters covering such topics a$ the cultural background of public 
opinion, sampling of opinion, the techniques of polling and intensive 
surveying, the nature of propaganda, content analysis, the perception of 
propaganda, the relation of learning and personality to propaganda 
reaction, and a triad on press radio, and the media of sight and 


sound.” 
Evaluation of Doob's 
place, it is more than à te 


book must proceed on two levels. In the first 
xtbook, representing as it does one of the few 
ly a consistent theory of behavior to the 
vast realms of opinion and communication. And then, it 1s also a text- 
book. : 
Consider Doob's book first a$ 
the opening chapter, 
Propaganda, we deal 
principle :ndividual behavior- y cult 
society an pii m i but in terms of their impact 
on individual behavior. 
of individual behavior, D 
Principles: the S-R relationship; € ep : 
the law of effect stated in terms of t -reduction; the interve 


t at systematization. In 


he study of opinion an 
but rather with the 


an attemp 
in t 


The necessary equ i 

in terms of these basic 
tension state; 
ning 


d R; the concept 
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shortly; the Hullian doctrine of habit and habit organization; and finally 
knowledge, variously denoted as know-how or skills or subjective, c08- 
nitive context. Attitude, obviously the central term, is defined in 
simple terms as "the socially significant internal response that people 
habitually make to stimuli." “Public opinion refers to people's attitudes 
on an issue when they are members of the same social group.” That, 1n 
essence, is the conceptual core of Doob's approach, and to it he sticks 
with striking tenacity through chapters ranging all the way from the 
learning of propaganda to the printed media. 

In Chapter 5, a crucial one dealing with the “behavior of public 
opinion,” we see a rather crucial test of the adequacy of 190 proof Hull 
as applied to public opinion. The result is curious. One can go only $9 
far, in the kind of theorizing which accounts for the development 0 
internal responses (attitudes) in terms of learned drive-reducing be- 
haviors, of successful propaganda as drive-reducing stimuli, and so 0”- 
When Doob comes down to an attempt to state some laws about public 
opinion, however, we discover that he too is strapped, for, lo and behold, 
there appears before us a description of opinion formation mechanisms 
in terms of the classical mechanisms of ego-defense: rationalization, 
displacement, compensation, projection, etc. The chapter concludes 
with some stunningly thought-provoking general laws of public opinion 
which attempt to relate the formation and expression of opinion to such 
phenomena as frustration, anxiety, and the need for conformity. For 
the reviewer, the chapter appeared as the theoretical high point of the 
book, the one point at which he felt liberated from the interminable 
circle of things working because a habit existed, because a stimulus ha! 
reinforcing value, or because a bit of behavior brought about a reduction 
in drive state directly or indirectly. The theoretical model, rigorous 
though it may be, and useful though it has proved in certain restrict 
forms of behavior analysis, is a woefully impoverished one when place 
alongside the real thing of public opinion and propaganda. While 
there are some conceptually provocative sections, one does not feel that 
Doob has succeeded in doing theoretical justice to the richness of the 
material which he treats on the descriptive level. For at this level, Leon” 
ard Doob is a knowledgeable, shrewd, and amusing observer who sees 
and understands much. 

We must talk now about Doob’s work as a textbook. It is a good 
textbook, and it is packed with a great deal of recent material on pro?" 
lems in opinion and communication. A review of this sort cannot cove" 
all of this material and its organization in the book. Let us merely try 
to touch on a few of the high points, some low points, and then try t° 
draw up a balance. The chapters on the sampling of opinion and th 
mechanics of polling are excellent, better than anything available for 

large class use. They are informative, full, and direct in exposition. 
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too, the chapter on the importance of public opinion, which represents 
a thoughtf ul departure from the arrant romantic nonsense which makes 
of public opinion an infallible demigod. The sections on propaganda 
have an admirable internal organization which takes the reader through 
propaganda from its perception, through the role of personality in 
mediating response, through the learning of response to propaganda, to 
propaganda-induced action. The point of vi 


P view represents, however, 
ittle change from Doob's earlier book on propaganda. Its organization, 
however, is much tighter. 


Now the low points. 
radio are more impoveris 
attention to studies of readership of newspaP' 
book reading has no reference whatever to the major studies in the 
field: the work of Waples and the national survey of Link and Hopf. The 
chapter on radio is good on statistical description of listening, but fails 
to take advantage of the rich literature dealing with gratification stud- 
ies, the "why" of radio listening. In place of reviewing these, Doob 
gives a rather spotty presentation of his own propaganda principles 


asl 
milieu f the book as a textbook is its failure to 
ntext, the major studies which have made 
ing more than a collation of newspaper 
The chapter entitled "Evaluation of 
Polling" is an excellent critique o u lescent extravaganzas 
of the pollsters—in which, unfortunately, various people are pilloried 
for views they never thought they had—but this good criticism is not 
counterbalanced by the pr basic and constructive 
work. Why not some extended treatment of such monographic studies 
as Lazarsfeld, Berelson, and Gaudet's T he People's Choice; © Jones 
of Merton's Mass Persuasion; of Murray 


A ; t ; Uu . 

ok Ao pis of Sentiments ; of Newcomb s d 
i : ikert” Federal Reserve studies; © ink an 

ced y ry Deena why so little on attitude scaling? 


Hopf's People and Books 

Doob Lis in preoccupi imes with his conceptual task that he 
has tended to forget the t nting to his student readers a full 
picture of the substantive content of the field of public opinion and 
propaganda. . . 

The balance unfortunately, is not as high on the plus pem as = 
might have expecte in a work by so talented an analyst as oob: ; D 
still lack an ideal textbook. But, for all its shortcomings, Doobs 
Public Opinion and Propagan i y above the level of previous 
textbook attempts in this field. 


The chapters on the printed media and on 


hed than they need be. There is insufficient 
ers, and the treatment of 


clippings from the Gal 


JEROME S. BRUNER. 


Harvard University. 
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Suerir, M. Am outline of social psychology. New York: Harper, 1948- 
Pp. xv+479. 


This text is intended as an introduction to social psychology and 
presents Sherif's approach to the subject as developed in his researc? 
and as expressed in his earlier writings, in particular, A Study of Some 
Social Factors in Perception, The Psychology of Social Norms, and (with 
H. Cantril) The Psychology of Ego-Involvements. The book is evidently 
meant for students who have had a course in general psychology, since 
it assumes considerable knowledge of psychological terminology and 
theory. 

Sherif defines social psychology as dealing with "the experience and 
behavior of the individual in relation to social stimulus situations 
(p. 1), namely “interpersonal relationships, group interactions and their 
products." These products include "values or norms, language; art 
forms, institutions and technology." For Sherif, the impact of soci 
stimulus situations on the individual constitutes the process of social- 
ization—'‘the central theme of social psychology.” 

Throughout the text, attention is paid to two chief determiners of 
the individual's perceptions, thoughts and behavior: his motives an 
his reference groups. 'The norms of his reference groups, past and present; 
have largely determined his frames of reference comprising his ego-attt- 
tudes (or values). However, at any given time, his deprivations (o 
biogenic and sociogenic motives) are critical selective factors in his pet 
ceptions, thoughts and behavior. 

Sherif holds to the view that social psychology uses the same con- 
cepts as general psychology but "extends" them to the social field. 
Accordingly, he most frequently introduces topics by reviewing relevant 
analyses and research findings of general psychology, then relates the 
concepts thus explored to social contexts. : 

In the main, social psychologists have concerned themselves with 
three major themes: personality structure and. functioning (apart from 
personality development), socialization (i.e., personality development 
and group dynamics (the interaction of individuals in face-to-face 
groups). Sherif places almost exclusive emphasis on socializatio™ 
largely ignoring the other two themes. Nowhere does he venture 4 
unified account of personality structure, though he uses such structure 
concepts as motive, perception, attitude, ego and ego attitude. » 
approaches a unifying concept most closely when he defines the eg 
as a "constellation of attitudes in the psychological make-up of t 
human adult” (p. 252). The nature of this constellation he does ge 
specify. Likewise, Sherif makes no attempt to describe or explain noT 
face-to-face groups are formed, or how individuals interact in SU s 
groups. This is not to say that he avoids mentioning such group? 
Indeed, he talks about crowds, in-groups and out-groups, etc. ape 
reports some of the research studies involving different sorts of gro 
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T " " 
he point is that he does not 80 beyond a very unsystematic treatment 


of enp structures and processes. 
"s. m discussion of socialization, Sherif emphasizes two things: the 
played by individual factors such as food deprivation in determin- 


in 1 , ; 
ng sortal perceptions and attitudes, and the effects of group norms on 
iduals. His account of socialization is more descriptive than ex- 


p In the main, he contents himself with showing that ego 
volvement occurs through social experience, and that group norms 
lated. He brushes aside such ap- 


and individual ego attitudes are corre 
learning theory and 


[decem to explaining personality development as 
reudian theory. While he obviously inclines toward Gestalt “field 


theory,” he does not discuss it systematically. 
Sherif cites quite à number of research studies at length. He in- 


cludes detailed accounts of the University of Minnesota starvation 
experiments, Newcomb’s Bennington study (a section written by 
Newcomb himself), W. F. Whyte’s study of street-corner gangs, and 
the leadership studies of Lewin and his associates. He devotes a good 
deal of space to his own experiments on autokinesis and ego-involve- 


ment. 
The volume is quite uneven both in style and level of difficulty. Some 
of the passages seem to be directed toward the professional psychologist 
rather than the undergraduate student. Many of the concepts use 
throughout the book are not carefully defined. Sherif quotes from 
authorities (notably C. T. Morgan, A. Gesell and E. C. Tolman) more 
frequently than seems necessary in his discussion of the biological bases 
of social behavior. On the favorable side, Sherif makes effective use of 
novel and highly interesting illustrative material, much of it in the form 
of anecdotes relating experiences of participants In World War II. | 
There is virtually nothing in this book on the methods of social 
n while describin i n detail, it 1s the re- 
hods which Sherif stresses . 
nie in social psychology which 
we have all been waiting for, ite. However, Sherif's 
! nO] and provocative, 


approach to the subject is original and his research 
contributions substantial. Many instructors who admire Sherif's 


thinking and research on fra 1] welcome this volume 
since it makes his views more accessible to the undergraduate student 


t ; ] pm 
han his earlier writings- GLEN HEATHERS 


New York City. 
F- quantitative study of the 


Hatsteap, W. . -telligence: 
, Warp C. Brain and intelliget 

frontal lobes. Chicago: Univ. Chicago Press, 1947. Pp. 206. $6.00. 
Within the so-called modern era of scientific re have m 
many attempts to find physical and physiologic s of intellec- 


inquiry the 
al correlate: 
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tual functions. On the physical side, these range from the pendo 
scientific approaches of Gall and Spurzheim, early in the nineteent 
century, to the more refined anthropometric studies of recent er 
some of which have culminated in general typologies. Among ^ E 
physiological correlates sought may be mentioned reaction time; re ks 
time, chronaxie, metabolism, brain waves and others. None, as à sepa 
rate measure, has proved effective or convincing. Yet, somewhere within 
the complex matrix of the living organism must lie the substratum upon 
which intelligent behavior depends for its development. In phylogenetic 
retrospect, the highest level of the central nervous system has seeme 
to many the most likely place to look because of its progressive hier- 
archy of integrative functions. 

This is precisely what Halstead has done. Exercising courage, con- 
viction and patience he has during the past ten years or $0 laid the 
ground-work for an entirely new conception of intelligence. In Bram 
and Intelligence he tells, albeit somewhat dramatically, how he has 
been able to utilize neurosurgical and brain-injured patients to tease 
out factors in a “biological intelligence" by some unique and some 
traditional test procedures. Halstead is a pioneer in a very important 
field of endeavor and he has made substantial progress in opening up 
this area of investigation. He has coupled quantitative, experimenta 
methods derived from psychology and physiology with a judicious 
clinico-pathological insight developed through a period of concentrate 
work with neurological and psychiatric patients. : 

In view of Halstead's extensive and significant work in forging 2 
pathway in a virtual wilderness, this book is somewhat of a disappoint- 
ment. Itis stimulating and provocative, but it is disappointing because 
it does not make clear the nature and significance of the basic concept 
which underlies the work, namely that a “biological intelligence" exists: 
Although it is evident that a great deal of careful and ingenious expert. 
menting has been done, the book is clearly deficient in the portray? e 
the experimental procedures and in the presentation of the experiment 
results. On the other hand, confusing hypothetical issues are given un- 
due space. For example, there is a persistent effort to make the fact? 
fit certain psychoanalytic concepts; this effort is obviously straine 
and without proportionate success. E 

Part I of the book is concerned with the structure of biological P^ 
telligence. It begins with an ambition to define and measure biologi¢ 
intelligence. Although the desire to be clear and straightforward apor 
biological intelligence still persists in Chapter VI with the stateme ij 
“our objective in this chapter will be a definition of biological zo 
gence in operational terms," there is nowhere in the book, a clear ail 
ment, operational or otherwise, as to what is meant by it. The ne 
chapters, or nearly one fifth of the textual part of the book, are i tly 
to identify Freud’s ego with biological intelligence and to dispose 18 
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^! theories of intelligence, psychometric, clinical, and neurolo- 
th Chapter V briefly discusses the background for the present study and 
: e method of selecting control subjects. The principle of “blended 
cr ' was settled upon. These consisted of 14 “normals,” 10 mili- 

= patients with mild psychoneurosis, and 6 miscellaneous subjects 
Or er some psychological or biological stress at time of examination. 
f ne may question this procedure, a5 well as the matter of basing the 
actor analysis upon 50 military head injury Cases, although reportedly 


recovered from mild concussions. 


There is needless confusion t ook in reference to the 


A total of 237 patients and 
as Halstead prefers to call 


control subjects and 
ix D presents raw data 
ients based upon 10 


them) are repeatedly ref 
for only 30 “controls” and 50 neurosurgical patie 
tests. Likewise, the summary implies that the results are based upon 
237 subjects and 27 tests. 
ntal data, Chapter VI 
d of four and one-hal 
lations for 13 tests or 
hese tests as 
determined by the Thurstone method. ther factor analyses Sup- 
plied by Holzinger are not presented. Although these reportedly gave 
four-factor descriptions, the reader is not told how they agree with the 
four-factor solutions DY the Thurstone method. . i 
Chapters VII to X are devoted to a description and explanation of 
the four factors, whereas Chapter XI fits them into the nuclear structure 
of the ego. The four factors are: (1) a central integrative field factor 
(C), described as the organized experience of the individual; (2) an 
abstraction factor ( A) found in grouping to categories and in under- 
standing similarities and differences; o ap r (P), a dynamic, 
energizing factor, said to counterbalance © ilate, 
and Piit further ego differentiation; and (4) a directional factor (D), 
which specifies the sensory modality of experience and the motor out- 
lets—an exteriorization of biological intelligence. | . . 
Part II deals with the represen ion of the basic factors in the brain, 
the way these factors ified by experimental anoxia, 
eli impairment of t 
a aspect has been quantifie 
he impairment index consists of a ten-po i 
allotted for reaching 2 set criterion © h test. The score range is 
0.0 to 1.0. Control subjects ow index: whereas patients with 
frontal lobectomies have he highest indices. Frontal lobectomies show 
a higher impairment index tha: ations in other areas 
of the brain. It should be pointed o" ht of the original 13 


experime 
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tests used in the factor analysis appear in the impairment index. Un- 
fortunately, several errors are to be found in the impairment indices 
of tables 15 and 16. Also in Table 16 there are several instances of the 
calculation of an impairment index based on only three, four, or hve 
of the test indicators out of a total of ten. This does not seem justified, 
particularly when different subjects might thus be compared on the 
basis of entirely different indicators. Appendices À and D provide, 
respectively, a brief history and diagrams of brain lesions for each of the 
50 lobectomies. . 

Although the reader will find much to stimulate his thinking in this 
book, he will undoubtedly yearn for more adequate descriptions of the 
indicators and procedures for using them, for more data than are pro- 
vided in the limited tables, and for more conclusive proof of the four 
factors and their significance and meaning. In general, he will probably 
wish that Halstead had described his tests and results completely, and 
perhaps had left ramifying theory and arm-chair speculation to be 
covered in a separate volume so labelled. 


DonaLp B. LINDSLEY. 
Northwestern. University. 
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ANALYSIS 


INTRODUCTION 

d in omnibus age scales like the Stan- 
nt scales like the Wechsler-Bellevue, 
hibit a measurable range of diffi- 
nses. The scatter records of 
hich the scale is standard- 
of the examined person. 
has been hinted at by the 


i Whether tests are administere 
ord-Binet or in homogeneous poi 
their successes and failures always ex 
culty and a characteristic scattering of respo 
individual cases depend partly on the way in w: 
ized and partly on the personality problems 


The possible clinical value of scatter analysis 1 
earliest experimenters in psychometrics. Its qualitative evaluation has 
gained momentum in recent years, reaching its culmination point in 


m apaponiis publication (26) on diagnostic psychological testing and 
hurstone's volume (33) on factor analysis. 
. The main reason for the psychologist's persistent reoc 

with test scatter is the amount of valuable information it yields as a 
Supplement to quantitative indices of brightness. Personality diagnoses 
often have a basis in the so-called intuitive interpretation of scatter 
whether clinicians are aware of it or not. 

THEORETICAL CONSIDERATIONS 
s of scatter is in part contingent upon the theo- 


the examiner. n the other hand, the system- 
significant to future changes 


preoccupation 


BP in clinical usefulnes 
$ voie viewpoint held by 
in c study of scatter may 
psychometric theory (19). M" 
Two general approaches to the problem of response variations on 
tests are distinguishable. Brody (6), Roe an Shakow (27) Richmond 
and Kendig (17) and those W i sts (32, 34) cling to the 
a that psychometric tests measur . Mental disorders 
reveal themselves, according to these students of scatter, through their 
"M eel effects on cognitive processes measured by tests. Those engaged 
factor analysis (33, 37) hold similar views In the interpretation o! 
tors to be organizational 


8roup factors. They consider the extracted fac 
177 


ly contribute 
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features within the sphere of intelligence. Though non-intellectual per- 
sonality traits and group factors are being mentioned together with in- 
creasing frequency, their relationship is rarely clarified to the satisfac- 
tion of the applied psychologist. 

Bijou (5), Jastak (13, 14, 15, 16), Piotrowski (22), Rapaport (26), 
and Schafer (29) either imply or express the view that intrinsic per- 
sonality traits may be appraised through test scatter directly without 
the medium of intellect. Speaking of signs of invalidity, Piotrowski 
(22) finds that some test failures may not be regarded as legitimate 
symptoms of lack of intelligence. Instead, they are expressions of unor- 
ganized or disordered personality functioning in the cognitive, affective, 
and instinctual areas of behavior. Rapaport (26) and Schafer (29) 
point out the dynamic character differences reflected in the relation- 
ships between scores of various homogencous scales. Jastak (16) finds 
that intelligence accounts for only 20 to 25 per cent of the variance O 
any one test and that the remaining variance must be accounted for 
by attributes and factors independent of intellectual level. 

The differences between these two schools of thought may not aP- 
pear to be great. Members of both groups accept scatter analysis as ? 
major clinical tool for the understanding of individual differences. How* 
ever, the personalistic approach probably assumes less and explains 
more. Every psychometric response has some sort of clinical validity: 
Whatitis valid for may not be accurately determined by a priori assump” 
tions but by actual analysis. The traditional pars-pro-toto theory ° 
intelligence artificially limits the experimental horizons of applied psy" 
chology. It ignores the Gestalt principle that every human act is the 
result of the total personality complex. If such traits as introversion: 
aggression, compulsion, suggestibility and instability affect human aur 
justments, it is unthinkable that they do not have a direct influence 
on test scores. . 

The intelligence theory often falsely identifies cognition with intelli- 
gence, confusing an overt act of behavior with a scientific abstractio™ 
It also leads to the juxtaposition of intelligence and emotion which agai" 
fails to heed the distinction between theoretical constructs and com” 
crete mechanisms (8). Furthermore, it delays the experimental eor 
cation of the theory of functional unities or independent traits whic? 
are likely to be productive of more fertile ideas in psychometrics ¢ a 
has been the rigid and monodimensional intelligence hypothesis. 

Let us assume, for the sake of argument, that behavior is deter” 
mined by two traits: (1) degree of intelligence, and (2) degree 9 


sanity. If, upon further study, it is found that all degrees of intelligence 


————— € 
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occur at a consta i 
a constant mei ie à ue E ns bios. iiie 
sine KOA ps E , then the two traits are for all practical 
effect ee . If this be true, then intelligence has no direct 
vusiisene ie and sanity has no direct effect on intelligence. High 
Readies eo no prevent anyone from becoming or being insane. 
fini Pus D AN € person to lose his intelligence. Low intelli- 
aimi v es : ociate with a high degree of sanity. To borrow a 
tate rect or analysis, intelligence and sanity are placed orthogo- 
tise, dee : other ina random sampling of the population. Neverthe- 
tenies pe sanity profoundly influence all adjustments simul- 
ie os re i erentially. Both may be positively correlated with 
bein. orrelated with each other. It is probable that scatter var- 
extent and nature with the combination of the two traits in each 


individual. 

vid Objective evidence for 

Th ed. The results of scat 
e limitations of tests in t 


such a conclusion has so far not been pro- 
ter studies are contradictory and confusing. 
he detection of mental disturbances, dis- 


e 
irse Magaret and Wright (20), may not be inherent in scatter 
uni a e but in the methods and theories employed in its meas- 
tates of A sychometric science may successfully catch up with the dic- 
dicate d eie: sense if its scope of inquiry is extended without preju- 
Then asic qualities of the personality. 
apne umber of traits directly measured by tests are more than two. 
even hes probably different from those used in our simplified example 
tear! ough these two variables form the central point of interest of 
Staats all investigators of scatter. The value of mental tests will be 
to "m enhanced as soon as we stop deciding beforehand what we wish 
i asure and confine ourselves to the empirical investigation of what 


ls 

pens being measured. 
of dran process of developing a quali 
Teme swi analysis certain precautions § 
the sg? of the pendulum from a rigid ar 
techniqu and vague hypothesizing such as is tyDIC 
Objective d; Scatter analysis may 
Culties a iagnostic aid if certain 
to the di re overlooked in its study- 
analysis iscussion of the most salient 


quantitative system 
hould be taken to obviate ex- 
d and faulty statistical base to 
al of the projective 
uirements of an 
| pitfalls and technical dif- 
ages will be devoted 


ciated with scatter 


tative and 


theoretica 
The following P. 


problems asso 


ND SCATTER 
exists in a cer 
ttering attes 


SCALE STANDARDIZATION A 


Th 

orndi " : 

e UN once said that whatever exists, 
rous attempts at quantifying sc@ 


tain amount. 
t to the ac- 
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curacy and wisdom of his observation. If scatter is orderly (16, 26; 29) 
and represents some consistent traits in the examined individual, then 
such traits should ultimately be measurable through indices of response 
variability. Despite the discouraging results of scatter analysis on the 
“Stanford-Binet (11, 17, 27), the process is being repeated on an even 
larger scale with the Wechsler-Bellevue tests. 

The Wechsler-Bellevue Scale has several important advantages over 
the Stanford-Binet Scale. It measures eleven separate abilities in fairly 
well-graded sub-scales. It has intra-test homogeneity and inter-test 
heterogeneity. This cannot be said of the Stanford-Binet Scale. In fact, 
the latter test measures with fair consistency only one type of ability 
which is closely related to Thurstone’s (33) verbal group factor. 

According to McNemar (21), all items of the Stanford Scale are 
saturated with one general factor. This factor is sufficient to accoun 
for most of the inter-correlations between items except at four specifie 
age levels (2, 22, 6, 18). McNemar also finds a motor factor a? 
memory factor which may be responsible for some test variability s: 
certain age levels. He warns, however, that the sampling of tests 1" 
each of these areas is highly unreliable, and therefore no sweeping PF, 
nouncements concerning special defects may be made on the basis © 
scatter. McNemar's analysis clearly indicates that the Stanford-Binet 
Scale lacks the much desired inter-test heterogeneity. It would have 
been a superior diagnostic tool if substantial samplings of many different 
abilities had been included at all levels. 

i The high correlations of most items with the general factor may b° 
said to be spurious because of the biased ability sampling of the $C? ik 
What McNemar calls a general factor may in reality be a combinati? 
of several factors. The differences in common factor saturation petwee? 
the various items are sufficiently great and consistent to posit the exis 
ence of a verbal group factor in addition to the general factor. TES 
hidden group factor, though as constant and stable as the general fac K 
is responsible for a great deal of scattering on the Stanford-Binet Sca 
in abnormal cases. A 

From our experiences with the Stanford-Binet Scale it may be cora 
cluded that attempts to assemble test items which are highly correla? of 
with each other and with the whole scale should be abandoned in on 
of less correlated but clinically more valuable items. A saturation vl 
efficient of over .60 with any factor should be viewed with considera 
suspicion, as it probably represents a statistical distortion produce 
improper test samplings or by criterion contamination. sons 

If the theory of multiple traits (16, 33) holds, then high correlati? 


ij 
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e little more than standardization 


between tests and factors or traits ar 
tests and traits, whether gen- 


artifacts. For the relationships between 
eral or specific, are mediocre at best. 
Nee x centroid verbal factor is so prominent and extensive in the 
xil Esdr inet Scale, it 1s likely to misrepresent the mental levels of 
intelligent persons who are inferior in this dynamic trait. This is 
one of the most serious shortcomings of the Scale. The 1.Q.’s obtained | 
from such a scale should be used with utmost caution in all diagnostic 
work with problem children. 
: High saturation of tests with factors supports the frequently recur- 
ring idea that there are pure tests of this and that factor (8, 37). Ey- 
senck, for example, uses the Raven Matrix test as à measure of intelli- 
gence because it is supposed to be a test of pure “g” This method of 
measuring intelligence contradicts Eysenck's very poignant argument 
that each test is the result of at least four factors, one general, one 


centroid, one specific, and one error factor. The theory of manifold 
nent that it should relegate the 


Causation is so reasonable and perti it s 
idea of pure factorial cal myths. The re- 


viewer suspects that th 
group factors in addition to the general fac 


test is a relatively pure test of "g" mu | 
mental design. The same may be said of tests which are supposed to 


be pure measures of some group factor (37). The existence of pure tests 
of either general or grouP factors cannot be reconcile 
cal realities in clinical practice- ' 
One of the greatest, and as yet unsolved, problems of the pr 
mental laboratory and cli ordinate interprete e of t 3 
results from many differen t (26) and Eysenc (8) ra 
oe large number of different tests and techniques without = = 
direct comparisons between them. This procedural a ES ei 


break i i i ificial and over 
s up the diagnostic study into artifici 
but also retards the final determination of what the various tests meas- 


ure and how well they measure what they do- When the Bellevue and 
Babcock tests (26) differentiate between neurotics and schizophrenics, 
they may not necessarily do it in the same manner. The factors in- 
Volved may not be identical or even similar. When tests of nos 
Social] history, persistence choline esterase, and body bui : i ae 
entiate between Eysenck’s (8) hysterics and dysthymics, : de i. 
follow that all these methods are good an valid tests a ae 
temperamental factor as Eysenck seems to imply. The way nee ie 
what is being measured by all these tests is to administer them to 


to faulty experi- 


nic alike is the co 
t tests. Rapapor 
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same experimental population and to a control group and then subject 
them to an over-all factor analysis. In this manner, the similarity OF 
dissimilarity of the analytic criteria would soon become apparent. It 
would also facilitate the discovery of several differential signs based not 
on the nature of the testing procedure but on the nature of the psychic 
traits or genotypes measured. 


FourFOLD ANALYSIS OF TESTS 


Psychometric batteries composed of a large variety of independently 
scaled, homogeneous abilities yielding separate scores are conducive to 
a fourfold analysis of test variability. 


First, they permit the comparison of one individual in any ability with the 
norm of many other persons of the same age in that ability. This inter-persona" 
index is similar to the I.Q. except that reliable scores obtained from several dis- 
crete test functions increase the analytic possibilities. They provide a mult! 
dimensional system of comparison. à 

Second, homogeneous subscales measure the intra-personal organizatio 
functions which are directly and intimately related to dynamic personality 
traits. The discrepancies between scores within each individual record are 9 
great diagnostic importance (4, 5, 8, 16, 17, 23, 26, 28, 29, 30, 31, 34). d 

Third, the response variations of a person within each subscale may be use 
in the interpretation of test results. Two persons obtaining identical results 0” 
an information test, for example, may have arrived at this score in à tota A 
different manner. Indeed, identical score patterns within a scale are 4 rar 
exception in records with identical final achievements. Eysenck (8) attemper 
to measure intra-test scatter in relation to the Matrix test. Because of t 
method he used and because of the lack of direct inter-test comparisons his er 
sults were negative as far as differentiation between his diagnostic group kom 
concerned. The reviewer (16) believes that intra-test scatter may contribu d 
to the understanding of personality dynamics. It is to some extent relate 
to inter-test discrepancies. It defines the nature of inter-test variability- 

Fourth, a test response may be analyzed in regard to several aspects bud za 
have so far not received sufficient objective evaluation. They are, howe? g 
given some attention in Rapaport's (26) and Schafer's (29) writings. Reach a- 
times, rate of production, amount of output, contents of response, form 9^... 
sponse, nature and number of errors preceding the solution, and pro) 
qualities deriving from emotionally tinged, personal experiences are impor 
factors in all tests. These facets of a response undoubtedly influence the c ci 
cian's judgment even though they receive no methodical statistical analy? as 
That such an analysis should ultimately be made to keep diagnostic inferen 
within reasonable bounds is self-evident. It is precisely these unobjectifie far- 
intangible qualities of a response that lend themselves to the making of a 
fetched and exaggerated claims. Clinical rationalizations may be kept ces 
minimum through the adoption of multiple scoring techniques which, if ne rac" 
sary, overlap some or all of the subscales of a battery. The appraisal of eie. 
teristic and psychologically uniform response deviations appearing inter 


n of 


hich 


jini- 
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J tently th 
4 hroughout the battery may lead to the isolation of a number of helpful 


diagnostic signs. 
Like life i " 
"Wo cadens tests are neither fully structured nor unstructured. 
| i ei ta ements are liable to occur on all tests. A vocabulary test, 
y standardized, may be equally as projective as the Rorschach 


| test (18 ~ 
esi : Furthermore, the measurement of projectivé qualities by 
of vocabulary and other psychometric tests is likely to be more 
e similar measurements from so- 


| De more meaningful than ar c 
Hie sah nm tests. A change in the theoretical foundation of the 
Sounite {adic test will produce more satisfactory diagnostic and ther- 

ibe found icators than does its replacement by tests which elude scien- 
atment. 

his Bs Wechsler-Bellevu 

ini (14, 15) may be a valua 

Bisbee y precautions are aPP 

aA ems of scatter measurement. 

as Nido are great and numerous. 
asa uiu as the final test scores and 
quotients. 

P. ind psychom 

Re Hra of scatter t 1 

atal e this is done, psychologists wil j 

ysis as an after-thought rather than as 2 pri 

IA OF VALIDATION 

tally ill or disordered persons 

s come from psychiatric 

find them evaluated in 

4ons may be raised to 


e Scale (34) or any similarly constructed bat- 
ble tool of multiple response analysis if the 
lied to the theoretical and technical 
The potentialities inherent in such 
Variability studies are certainly 
more informative than intelli- 


etric scale should, indeed, be standardized for the 
chievement norms. 


in addition to providing a 
| have to be satisfied with cluster 
mary diagnostic aid. 


EXTERNAL CRITER 


e often observed in men 


Scatter is mor 
t scatter studie 


than i $ 
di? in normals. Since mo$ 
re and mental hospitals, 

ms of psychiatric disease entities- 


s 
e a procedure. First, attendance at ent to à hos- 
al does not make a person abnormal and, vice Versa: not being à 


oe patient does not make a person normal. Second, psychiatric 
assifications of mental illness are mono-dimensional and therefore do 


n : oe 
ot fully agree with all the facts in the given case. Psychiatrists gener- 


a à 
dy consider their present-day nosology outdated and unsatisfactory. 
great deal of research wi to objectify the syndromes 


associ Il have to be done to ‘ Bru 
oo with mental illness and persona e. : iaa 

i iti r s 
ence of disease unities SUC iatric diag 


nos ; : 

* esos experimental verifi 
! fluid e criteria differentiating One illness f 
- The diagnosis therefore depends largely 0? 


cation. 
rom another are vague and 


the school and sub- 
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jective bias of the individual psychiatrist. Some students of scatter 
analysis (8, 16, 26, 29, 34) are fully aware of the implications inherent 
in the use of comparative standards of low or dubious validity. There 
may be substantial agreement between psychometric findings and 
psychiatric evaluations outside the reference frame of official nomencla- 
tures. However, most scatter analyses are based directly on official diag- 
nostic categories. The outcome of such studies hinges as much on the 
objectivity of the psychiatric grouping as it does on psychometric 
analysis. i 
This dilemma may be overcome either by abandoning the practice 
of validating scatter against psychiatric diagnoses as now conceived 
and employed, or by obtaining from the psychiatrist a comprehensive 
evaluation of the adjustment pattern of each patient. Another method 
of improving validation criteria is to give greater weight to the intrinsic 
relationships of test scores obtained from the experiment. Chemists, 
physicists, and astronomers depend almost exclusively on criteria of 
internal consistency set up as a result of well-planned laboratory pro- 
cedures. Objective psychology has reached a stage at which criteria of 
internal consistency may be of considerable service. For example, the 
discrepancy between drawing and arithmetic Scores 
criterion of mental adjustment than any extraneous cri 
actual life developments of the individual case. The 
should be checked and studied by the experimenter 
complicating the issue by seekin 
value. There may, 
test criteria and livi 


is a more valid 
terion except the 
se developments 
directly without 
g opinions and impressions of inferior 
in fact, be a closer agreement between psychometric 
ng realities than is generally assumed to be the case- 
Psychologists will make their greatest contribution to psychiatry when 
they emancipate themselves from current psychiatric systems of diag- 


noses and evolve a rigorous and multiple system of comparison between 
integrated and unintegrated personalities. 


REFERENCE POINTS 


The phenomenon of response scatter is a universal one. It has 
many degrees and many qualitative variations. It occurs in persons of 
all ages and types. The general assumption is that scatter is greater in 
those with abnormal than in those with normal personalities. This as- 
sumption will be found to be Correct when tests are standardized for 
scatter on the basis of criteria of internal consistency. One of the 
simplest and most direct measures of scatter is the range ratio. It is 
obtained by dividing the lowest score from a large number of subscales 
by the highest score from the same tests (16). The range ratio of the 
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Wechsler- i 
ae 3 "euis test is considerably smaller in all abnormal groups 
e normal population. It indicates that disturbed and im 


organiz i 
ed people are more variable, less dependable, and less compact in 


most of thei 3 

of the sex "à ustments (16). The range ratio is an objective measure 

distinguish betw ken wulignnerdi-xi E HIDU Bru 
Whenvation wc ifferent types of scatter within each record. 

OF RIE A: testa P » made to determine the scatter clusters between 

iso" nin e choice of a reference point from which scatter is 

(ites, GE ia at es important. The psychological and statistical sound- 

nude um m erence point governs the soundness of the final scatter 

du dits amg have statistical stability and psychological relevance. 

Hee R. s, Ep: be clinically valid and statistically reliable. Its 

aimi ind quently a function of the personality theory to which the 
[rm - subscribes. : 

vf test mens E reference points have so far been used in the study 


T Deviations of test scores fro 
a aa to be a good test of origina 
, 2) and Eysenck's Matrix test scat 


m the stable score of a test which is usually 
lintelligence. Babcock's vocabulary scatter 
i E ter (8) are two examples of this type. 

ear pe of scores from the mean of all tests or of a group of tests. This 
of the most popular methods of scatter analysis. It has been used by 


We 
up (34), Rapaport (26), Schafer (29), and many others. 
. Factor or cluster analysis in which the mean of one cluster is compared 


With 

pd means of other clusters of tests. Thustone’s (33) group factors and 
s (16) altitude scatter belong in this category. 

ctical value of these three 

d experimental fact. 


L vr. 
the fios us scrutinize the pra procedures in 
ght of clinical experience an 


Vocabulary Scatter 
lary scores with other test 


f the most stable rating used 
It is also relatively invulnerable to the 
es such as are observed in psy- 


f the vocabulary score has been 
f tests as a method of meas- 


comparing vocabu 


The advantages of 
lary yields one o 


s 
nin 2r two. The vocabu 
posee. ometric examinations. 
Chotic e of disorders and disturbanc 
noted denies This refractoriness 9 
uring titeln after the introduction O me s 
Manner fo igence. Wells (35) was the first one to use it in a systematic 
purposes of measuring the variability of mental patients. 


ab 
‘ihe ce (3, 4) has elaborated the relationship between vocabulary and 
tests into her well-known " theory. The vocabu- 


ary test, bej 1 “Jevel-efficiency | 
ex of na emg uninfluenced by mental disorganiza 
Teduced ian capacity, while the highly sensitive € 
y mental disturbances, serve 


tion, serves as an in- 
fficiency tests, being 


as measures of sanity. The rela- 
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tionships between the two types of tests may thus be employed to deter- 
mine the degree of deterioration. Babcock (1, 2) has provided experi- 
mental evidence for the partial correctness of her theory by testing it on 
patients suffering from general paresis and dementia praecox. 

More recently, Rapaport (26) confirmed the diagnostic value of 
vocabulary scatter by using Babcock's tests in clinical examinations. 
He states "These statistical data then substantiate the contention that 
Vocabulary is the least variable and one of the best retained of the sub- 
tests; thus it might well serve as an indicator of the original intelligence 
level, and as a standard of comparison for the other subtests." The two 
premises underlying this method of scatter analysis are that vocabulary 
is a good test of intelligence and that its score is insignificantly depressed 
by mental illness. Though vocabulary has been shown to decline some- 
what with age (34) and to decrease significantly in cases of organic de- 
terioration (6, 7), its relative invulnerability has been sufficiently docu- 
mented by actual test throughout the history of clinical psychology: 
The stability postulate of vocabulary scatter might, with some reserva- 
tions, be clinically acceptable. 

The assumption that the vocabulary is a superior test of intelligence 
depends on the definition of intelligence and on actual correlations be- 
tween intelligence and vocabulary. Since a general theory of intelli- 
gence has so far not been evolved, the postulate that vocabulary gives 
valid measures of intelligence remains experimentally unverified. Jastak 
(16) finds that the vocabulary is no better a test of native capacity than 
is any other well-standardized test. If this is true, then vocabulary scat- 
ter should prove quite unsatisfactory and even misleading in a large 
number of mental patients. It is imperative that the reliability of a score 
not be confused with its validity. The Wide Range Reading Test (13) is 
even more stable and less vulnerable in mental examinations than is the 
vocabulary. Yet we would not dare suggest that oral reading is a valid 
test of pre-psychotic intelligence because no test can be credited with 
such unusual powers, no matter how stable its score may be. This criti- 
cism applies in even greater degree to Raven's Matrix test used by 
Eysenck (8) as a measure of intelligence. 

If a person has a defective vocabulary despite good intelligence: 
then the difference between vocabulary and efficiency tests may be 
positive or may indicate a favorable degree of personality organization 
even though the person is severely disorganized. Furthermore, an indi- 
vidual may have an efficiency score of 120 and a vocabulary score of 
75 before he becomes psychotic. During psychosis the efficiency score 
drops 40 points (from 120 to 80), whereas his vocabulary score remains 
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75. 1 
ae righ als all "e - clinical practice) the vocabulary 
eee alti a measure of € ficiency for the simple reason that 
renee hs exi measure of original intelligence. It is incapable of 

Fond decline in efficiency which has actually occurred. 

uice eer or’ ae 
E SO catiente: 4h fl ri t s catering to a highly 
ict aee he e s. t fails to distinguish between integrated and 
onsite a ed ehavior in the majority of cases treated in clinics and 
The re Sw hich draw their patients from all segments of the community. 
af dicc (14) has found that the magnitude and the distribution 
ide abulary scores among mental patients are similar to those of nor- 
by DR This would indicate that vocabulary is indeed uninfluenced 
that rie disturbances. This does not, however, entitle us to conclude 
ede: he vocabulary test always furnishes an accurate classification of 
es intelligence. Mental levels based on vocabulary scores are 
C curate and valid in not more than 10 per cent of mental patients. 
lin icians will, therefore, do well to use the vocabulary test with utmost 
Caution in the determination of intellectual level and also in the deter- 
mination of efficiency by means of vocabulary scatter. 
Babcock (3, 4) has advanced psychometric theory by adding the 


fficiency dimension to the intelligence quotient. This extension of the 


usefulness of tests is valuable but it does not go far enough. The human 
Personality is multi-dimensional. The most complete system of psycho- 
fore be manifold. As Rapaport (26) 


Metric interpretation must there 
Puts it '*. . , Vocabulary Scatter ..- is of little value in gauging the 
Seneral spread of scatter over the scattergram. Our approach to scatter 
analysis is to consider the relation of all subtests scores to each other. 
Onsidering scores in relation to the vocabulary level is only one aspect 
9f this approach.” A multi-dimensional approach is not limited to 
points on the 


One roi 
ue rigid reference point, but employs many reference poin 
Dasis of their empirical value in test analysis. The recognition of this 


op portant tenet will obviate the elevation of one test to the pedestal 

ice reference point (2, 8) since it limits the possibilities of 

in scatter analysis through the study of many other relation- 
Mean Scatter 

d clinical featu 

the deviations © 


mg isi i i res of the Bellevue 
S ppraising the diagnostic an 
Soe Wechsler (34) has found that t f the germ peria 
to the from the means of all scores bear certain consistent ations ips 
indi € disorders of psychiatric patients. Wechsler uses differences o 
‘vidual test scores from the means. Rapaport (26) and Schafer 
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. * h- 
(28, 29, 30) employ a similar method of siqui cipia Bene dco 
nique is, of course, more subtle and more versatile poris ie 
scatter. It comes close to a multi-dimensional system and sa neg 
some extent the criterion of psychological polivalence of test we " . 2 
hould prove serviceable if the mean of all scores meets the validity 
ne ser of analysis. The mean of all scores is undoubtedly a very 
Side reference point. Whether it is psychologically relevant ie ches 
certain. Its relative constancy is an asset, its psychological am ga det 
aliability. The mean of all scores often acts like an unwanted mide a 
man. It cuts the real discrepancies and disregards the direct d 
ships between tests. The difference of 9 points between the weig poi 
score of 12 on the vocabulary test and the weighted score of 3 pe ke 
arithmetic test may be more meaningful than are the two scores o 
—5 from a mean of 8 points. ; 
a difficulties of mean scattering increase when the mean is ~ 
to the extremes of the distribution of individual weights. In such cas 3 
itis a highly contaminated statistic. It neutralizes the very idit 
which are of diagnostic importance. An acutely psychotic or sanot 
disorganized individual may have a low mean score as a result yen 
personality defects. The actual discrepancies from the mean, whethe 
cumulated or not, may give no evidence of a disturbance when in reality 
it is very severe. The more generalized the mental disorder the - 
efficiently does the mean differentiate between normal and abnorma 
functioning. The general thesis may be adopted that the use of € 
reference point for the measurement of a trait is impractical if the tr2! 
in question has a variable effect on that reference point. If the mean id 
all tests, like the vocabulary, were unaffected by personality impait 
ment, it could well serve as a reference axis of scattergrams. Since this 
requirement is never satisfied in clinical studies, the mean of many tests 
may not be regarded as a psychologically adequate pivot about which 
scatter varies in a consistent manner. Like the vocabulary test, it ba 
a great deal of reliability but little validity for the accurate apprals# 
of scatter. The vocabulary score is relatively uninfluenced by mental 
disorganization, but it is not a valid measure of original intelligence 
The mean score is an invalid index of original intelligence because it 1$ 
influenced by mental illness. 
A third flaw of the mean is that the various types of mental com 
ditions have different effects on it in different persons. It may thus 


give an accurate picture of a trait in one individual, but obscure the 
very same trait in another individual. 


Tene 
gard to the clusters in whi 
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Cluster Analysis 


Itisi " 
deing seen d realized that clinicians and factor analysts are 
deias ier i be different ways. When Babcock (3) decided to 
Raae a attery into motor, learning, and memory tests 
grouped the subtests of. Piara e bebes (00 
patie. f cale into verbal and pe 
Sisi. c bee the two types of tests measured Hr 
ee ok n À ) divided the scores of one cluster of tests by the 
n a er € uster to differentiate between schizophrenia and 
age d sive psychosis, he applied a clinical method of factor analy- 
fem ES dir pn (26) eliminated the digit span and arithmetic tests 
indes mil " scatter measures, he employed a legitimate method of 
sium — sis which could have been readily confirmed by Thur- 
^ roid method of factor analysis. 
Wa ice qe n (8, 16, 33, 37) accomplishes by statistical means 
BE karda vely short time what the clinician discovers after many years 
ael won experience. Even then, the clinicians’ hunches may be in- 
by f e and unrefined in comparison with the differentiations supplied 
actor analysis. That is why Thurstone's group factors may be 


RES 

Bj eri another form of scatter analysis. 

reduces pi analysis has the advantage over other systems in that it 

smaller n ne DE infinite complexity of test interrelationships to a 

fien to et er of psychologically pertinent unities. It directs our atten- 

are fact e weighty fact that scatter analysis has as many facets as there 

xar: Scatter may be large in some tests and small in others 
x" ing on the personality structure of the examined individual. 

Sine s we learn to differentiate between types of scatter within the 

| test battery, we are likely to end uP with few or no positive signs 


espi > 
Spite Herculean efforts- When scatter scores are averaged without 
ch they occur, their diagnostic significance 


ma ^ 

j^ Sii La in the process of analysis: 

1 » strong points of factor analysis may 
Ous ists, an objective and efficient means of grouping psy¢ 
2. It; " 

provide: H : le 
ated dues. a highly differentia 

t supplies several stab! 


Cal 

1 "eaningfil, 

Such vs shows that one sub-test may belong to two or more ciu à 

Possibitne: should be used in the appraisal of more than one trait. This latter 
ity is especially difficult to g2"E* by mere clinical im 


be summarized as follows: 
hologically homogene- 


isol ted system of scatter analysis within the 
le and important reference points that are clini- 
clusters and that 


pressions. 
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The clinical application of factor analysis has been limited in the 
past because of several shortcomings in the interpretation of factors. 


1. The naming of factors after test contents does not satisfy the clinical 
psychologist who thinks more in terms of personality dynamics than in terms 
of test materials. 


2. Factor analysis has not been able to provide factorial scores which are 
actually uncorrelated with each other. For example, the number factor and the 


or more general factors which are responsible for the positive correlations found 
between group factor scores. Unless these general factors are accounted for 


the following methodo- 


A - nd measurement of the 
general factor of intelligence; (b) the isolation and identification of group fac- 


amics; (c) the study of the relation- 
ors; (d) The empirical demonstration 
s from intelligence are independent of 
; and (e) the empirical demonstration that the devia- 


are independent of each other. 


re of each one of the 11 Bellevue sub- 


3 d score to equate the achievements O 
different persons on the same test. The weighted score is in turn trans- 


muted into an intelligence quotient. The weighted score and the in- 
telligence quotient are statistically equivalent for they are both stand- 


ard anim Psychologically, they differ significantly because the 
weighted score disregards age whereas the quotient does not. The 


Wechsler quotient is thus a convenient method of keeping age relatively 
constant. Scatter patterns are more accurate and more meaningful 


As is well.l 
tests is converted into a weighte 


à 
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when expressed in terms of quotients rather than in terms of weighted 
scores, 

Rapaport's (26) 67 schizophrenics have a mean age of 31 years; his 
33 depressed patients have a mean age of 49 years. The depressives are 
9n the average 18 years older than are the schizophrenics. Rapaport 
finds that “No schizophrenic group, not even the Deteriorated Schizo- 
Phrenics, show such a consistently great impairment of all Performance 
Subtests as the two Depressive Psychosis groups. This scatter pattern 
1S of crucial diagnostic significance for differentiating Depressive 

Sychoses from Schizophrenia.” 

If age is kept constant by the use of test quotients, his depressive 
SrOups fail to show the consistently great impairment of performance 
Subtests, When all the precautions discussed in this paper are applied 
to Rapaport's cases, the average performance scatter score (deviation 
Tatio from capacity) for his 44 most chronic and deteriorated schizo- 
Phrenics is 82.9. The average deviation ratio for his 33 depressives is 
82.2, The difference is insignificant. Both scores are significantly below 

€ average (normal) scatter score of 87.4. Rapaport's deteriorated un- 
Classified schizophrenics (N =7) have a scatter score of 74.7 which is be- 
low that (78) of his most disorganized depressives (N =8). Rapaport ni 
Crucial scatter differences are more a function of the age differences 
tween the diagnostic groups than they are of the groups themselves. 
Oster (9) has shown that when weighted scores are corrected for age, 
the originally diagnostic scattergram turns out to be an artifact. The 
test discrepancies either become smaller or disappear when age is kept 

Constant, . s 
It may be added that the performance deviation ratios of Rapaport's 


Eroups are generally high and the discrepancies rather ie Tes 

“PProach the norm of the population at large much more boos fol E 
9 the ratios of hospital patients studied by the pea = oe 

“xplained in two ways. Either Rapaport's cases were n inae T 
Jan are psychiatric patients generally. Or, the diminished sca 


to tł : . i igh average and superior intelli- 
ne fact that his patients were of hig fica stutiy-6l desc 


Hace. High capacity as such is not a deterrent in Am entea 
eWever, our experience indicates that the i € veneni 

actor f lity range. It does no 1 
Y at the extremes of its ability rang Bis tee aupodoritg. Noè 


ini Sufficient opportunity to chips ses alu Hie tol. bral ei 
i i i nce à 
| aes ee ai dq = Mie Scale is a distinct weak- 


Stupid; 
ty. This telescoping effect of the amr 
"m in all lveetipartons of scatter. Patients are — fro 
" because of the limitations of the scale at both ends. 
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f 
Any scatter analysis of the tests of the Bellevue Scale e 
weighted scores is of dubious value, unless the experimenta o 
groups are homogeneous in age or unless test quotients are us 
for age differences between members of such groups. 


Sex DIFFERENCES 


It was seen that age differences may produce typical woe ap 
These patterns are of little value in diagnosis if not correcte cant 
Another factor which is likely to disturb the clinical significance v bé 
test analysis is sex. Eysenck's study of test variability (8) is one es 
most satisfactory of all those published in that he always sepa 

two sexes. 
vs As long as psychometric ratings are expressed in averages from qu 
different abilities, the differences between males and females wider. 
tralized. This is especially true if the battery consists of an equa. ie 
ber of tests favoring males and females. As soon as the ác ed 
broken up into subcomponents, significant sex differences may al E of 
in some of the clusters. Scatter analysis will then show disorganiza diss 
where it does not exist. Or, it may show normal organization in nh 
turbed individuals. Sex differences may not be ignored in the ingert s 
tation of various types of scatter. On the Bellevue test, males ten jue 
be better in information, picture completion, object assembly, vii 
metic, and digit span. Females tend to be better in vocabulary, sy™ v 
substitution, comprehension, picture arrangement, and block denm 

Vocabulary scatter may, in the light of these differences, we 
something different in case of women than in the case of men. Ifa Pt 

dominantly female cluster of tests happens to be used in the computat! s 
of scatter scores, it may give a distorted picture of personality func! 9". 
ing just because the patient is a man or a woman. 


5 to 
Future psychometric scales, if standardized for scatter, will have 
be equated for sex differences. 


ARITHMETICAL DIFFERENCES AS MEASURES OF SCATTER 


It is common practice to express test discrepancies in terms of em 
ferences between scores. Thus vocabulary scatter (1, 2, 26) is pne 
measured by subtracting the vocabulary score from the efficiency punt j 
If the difference is positive or small, the subject is assumed to be norm 
If the difference is a large negative one, the patient is thought t? 3 
deteriorated. Weighted and sigma scores are used in the caleulatio a ; 
differences, presumably to render them comparable. In the mean : 2h 
method, the subtest score is subtracted from the mean. Babcock (1; 


t 
Y 
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Wechsler (34), and Rapaport (26) use the method of differences between 
weighted or standard scores. Rapaport (26) mentions the necessity of 
dealing with positive and negative numbers as bothersome, but this is 
not the real difficulty of the method. 

: Wechsler (34) points out correctly that identical differences have 
different meanings at different achievement levels. Thus a difference of 
3 points at the weighted score level of 3 to 6 is twice as great as the dif- 
ference of 3 points at the 9 to 12 level. If this important fact is ignored, 
the findings of scatter analysis from the method of differences are in- 
comparable. It must be stressed that differences in terms of sigma scores 
do nothing to alleviate this problem. 

Differences are positively correlated with the magnitude of the 
Subtrahend. Wechsler (34) overcomes this difficulty by dividing the 
total weighted scores of ten tests by 40. The quotient thus obtained 
indicates the significance of the differences between scores at different 
test levels. When the sum of the weighted scores is 120, a difference of 
3 points (120/40) is significant. When the sum of the 10 weighted scores 
is 60, a difference of only 1} points (60/40) is significant. — 

Jastak (14, 15, 16) has achieved the same results by dividing di- 
rectly the smaller standard score by the larger. One of the characteris- 
tics of ratios is that they are uncorrelated with the denominator used 


in their calculation. The best known example illustrating this principle 
is the intelligence quotient. The I.Q. is by definition uncorrelated with 
the chronological age. If an index of brightness had been proposed in 
terms of the plus-minus differences between mental and chronological 
age, it would have the same unwieldy and misleading properties typical 


of most scatter measures now in use. 

The een d rrene between scores for two persons a ee 
Parable unless both persons have identical subtrahends. m 
difference scatter of deteriorated schizophrenics 1$ not compara 
that of depressives unless both groups have identical yi —€— 

Ratios are generally more useful in the study of the re € p 

€tween tests than are differences (10). They may, for «hari e, be 
conveniently employed in measuring the correlational TU group 
actors when the general factor is kept constant (14, 15, "^ is 
N€ group factor scores by the general factor scores e i: em 
is end, The deviation ratios from capacity of each 3 € Hs 
Feneous cluster are positively correlated with each other. dee 
and, the size of the deviation ratios of a cluster from ar AR 
sorrelated with capacity. Furthermore, the deviation ratios o T ends 
?Y be shown to be uncorrelated with the deviation ratios 0 
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cluster. When the personality trait representing a group factor stands 
in a compensatory relation to intelligence, as happens in some highly 
selected vocational groups, the correlation between deviation ratios of 
one cluster and intelligence (or another cluster) may actually be nega- 
tive. In random samplings, the correlations between general and group 
factors approach zero. : 

That human adjustments are governed by compensatory relation- 
ships between dynamic traits is more than a popular belief. Scatter 
and cluster analysis may become appropriate vehicles for demonstrating 
that beyond the correlational realm of measured abilities there exist 
traits or factors which are either uncorrelated or which, under certain 
conditions, may take up opposite poles in relation to over-all personality 
functioning. 

The scientific proof of trait compensation devolves upon one impor- 
tant contingency. Select groups may not be treated as self-sufficient 
units, but should be compared with the norms of a broader universe" 
the total population. The experimental controls must be random in 
regard to all personality traits which characterize people because 4 
these traits share in the production of scatter on tests. The origina 
sampling must be random at all age levels and for both sexes. 


PsycHOsIs, COOPERATION, AND SCATTER 


Eysenck (8), Rapaport (26), Jastak and Vik (15) speculate about 
the possible relationship between psychosis and psychometric evidence 
of disorganization. The consensus of opinion seems to be that psycho 
metric disorganization is more basic and more far-reaching than is the 
fact of acute psychosis. Abnormal scattergrams may precede the onse 
of psychosis by many years and may continue unchanged following 1€ 
mission. Many psychiatric remissions are surface phenomena unac 
companied by deep-seated changes in personality organization. e 
personality deviations measured by tests may serve as the backgrou” 
for the chronic or intermittent exacerbations in the form of delusion® 
hallucinations, and acute psychotic states. is 

The correlation between degrees and types of scatter and psycbon, 
is not high, as Wittman (38) and Brody (6) point out. There is no > 
reason for assuming that such correlation should be greater than moe i 
ateat the very best. Attempts to prove that psychometric disorgani^ 
tion and psychosis are highly correlated is a futile undertaking. Whe 
ever high correlations between psychosis and psychometric scat 5 
reported, they may be considered the result of ill-designed experimen p 
A psychosis is not infrequently an acute and temporary condition SUP 


B 
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— upon a relatively normal personality. To measure scatter and 
organization in psychotic and deviant persons, the scatter of normal 
po be known. The judicious use of in ternal criteria of consist- 
E || demonstrate that those usually considered normal may not 
essarily be so, and obversely, those considered abnormal may not be 
so either. 
" ds (36) expresses the view that psychometric scatter as meas- 
red by Babcock’s method may be a function of test cooperation on the 
part of the subject. She is seconded in this view by Brody (6). Accept- 
ance of this opinion depends on the definition of cooperation. Inability 
to cooperate is usually a part symptom of the mental disturbance and 
therefore need not cause serious concern as a major factor in scattering. 
Great skill and- patience are required to administer tests to disturbed 
individuals. The results of such tests are fully as valid as those obtained 
from entirely cooperative persons. When the patient is unwilling to 
Cooperate, the test is incomplete and scatter analysis impossible. There 
is little else to report except the fact of refusal and the circumstances 
under which it occurs. The reviewer is inclined to the view that the de- 
gree of cooperation is a very minor factor in the production of test 
Scatter, 
ne results are invalid 
and s only in cases of profound stupor, 
complete mental confusion. 


as measures of basic personality 
seriously reduced consciousness, 


SUMMARY 


ti Scatter analysis is a central problem in all psychometric examina- 
lons. Objective comparisons may be inter-personal and intra-personal. 


Inter-individual norms delimit the standing of a person in any ability 
ion in comparison with other persons in that ability or trait. Intra- 
ano vidual norms may be established by the study of inter-test discrep- 
a My and intra-test response patterns. Inter-test discrepancies require 
withi ematic survey of the relationships between many tests and abilities 
Güm i the same individual. The method of choice is factor analysis 
po. ap by clinical insight and a personalistic approach. 
cholo, € reference point of scatter analysis should be provided by a psy- 
r &ically homogeneous and statistically stable trait. A general fac- 
accurately measured and operationally defined, is the most suitable . 
ey oi Purpose. Neither vocabulary scatter nor mean scatter, though 
constitute invulnerable and stable statistics, 1S clinically applicable 


bulary test may not be con- 


? the 
Bidereg Y of personality traits. The voca lay 
f pre-psychotic intelligence. 


a s 
n accurate and valid measure O 
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The mean of all scores, on the other hand, is often a highly contami- 
nated value which reduces the sought-after variations in a variable 
manner. 

Psychometric scales should be standardized for scatter as well as 
for achievement scores in different abilities. Standardization samplings 
should be random in regard to all measurable personality traits. Exter- 
nal criteria of validation should either be improved or supplanted by 
criteria of internal consistency determined in the course of the experi- 
ment. 

The use of weighted scores or similar measures uncorrected for age 
and sex interferes with the objective appraisal of clinically valuable 
scatter records. The method of differences between scores and a refer- 
ence point should be replaced by the method of ratios of one score tO 
the other. The method of ratios eliminates positive correlations between 
the magnitude of the reference point and the size of the discrepancies 
from it. It makes the deviations comparable at all levels of achieve- 
ment. 

The most far-reaching change in regard to scatter analysis is the ex- 
tension of the value of psychometric tests to the field of general pet 
sonality measurement. The restrictive pars-pro-toto theory of intelli- 
gence should be abandoned. Furthermore, the idea that tests of pure 
factors can be assembled is contrary to the principles of personality 
unity and simultaneity. It may be safely assumed that any ability what- 
ever is the function of at least four types of factors: general, centrolC: 
specific, and accidental. 
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NUMERICAL TRANSFORMATIONS IN THE ANALYSIS 
OF EXPERIMENTAL DATA! 


€. G. MUELLER? 
Psychological Laboratory, Columbia University 


This paper is concerned with some problems that arise in the analysis 
of experimental data and, in particular, with a set of operations fre- 
quently useful in such analysis, the numerical transformation. The 
term transformation, as here used, refers to the operations by which 
one set of numbers or scores is changed into another set. The proce- 
dures of computing logarithms or reciprocals of scores are familiar ex- 
amples. In what follows we shall consider some of the reasons for 
converting a set of data into another form and discuss some of the 
problems that may arise from the application of such a conversion. 

Transformations are used most frequently in three situations: (a) 
the situation in which a quantitative theoretical statement or prediction 
is reduced toa simpler form so as to permit a more convenient inspection 
of the agreement between data and theory; (b) the situation in which 
we seek an empirical equation to describe a set of data; and (c) the situ 
ation in which one frequency function is changed to another to allow the 
application of more efficient statistical tests. 


TRANSFORMATIONS TO TEST THEORY 


The reduction of a quantitative theoretical statement to a simple 
convenient form for test often begins with an attempt to express the te 
lationship in linear form. [It isimportant to note that many expression? 
do not permit such manipulation (see Lipka, 31).] 


1. Consider a theory which demands that, to produce a threshold effect 


the product of the intensity, J, of a light stimulus and its duration, ^ be à 
linear function of the duration (25); that is 
It — a4 bi, n 
inst 
where a and b are constants. In such a case, a plot of the product, Jt, . c 
t should give a straight line with a slope of b and an intercept constant 4- l 
equation [1] may also be expressed in the form 
a [2i 
e) 
een eX" 


equation [2] presents an alternative way of testing the agreement betw 


1 Prepared under Project No. NR142-404; Contract N6onr-271, Task Order IX, be 
tween Columbia University and the Office of Naval Research, U. S. Navy. 
? The writer wishes to express his appreciation to Prof. C. H. Gra 
critical suggestions as to the form and content of this paper. He is also i | 
fessors H. E. Garrett and Joseph Zubin for their comments on the manuscript. 
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perimental m 
a straight d theni a plot of J against the reciprocal of ¢ should give 
Niels ices a a slope of a and an intercept constant of b. Under these cir- 
E iprocal transformation of £ might be used in analysis. 

after-effects of th example, suppose there exists an hypothesis of visual figural 
figure viewed in a, fallosing sont” Assume that any symmetrical inspection 
extent of the area din position builds upa region of depolarization, and the 
exposure of the pe epolarization fora given figure varies as a function of the 
tion nene She PER figure. With cessation of stimulation by the inspec- 
ces e o epolarization decreases in size as a function of time after 
fotice ee docui of a test stimulus after removal of the inspection 
that neis e = interaction of the depolarization due to the test figure with 
elias de i rom the activity of the inspection figure in such a way that the 
poli Thr polarization at a point is the sum of the two depolarizations at that 
TEA E A ce of an inflection point, L, in the resultant distribution of 
Büd the won n is a function of the distribution set up by the inspection figure 

point of inflection, L, is assumed to vary with time after inspection 


ac i 
cording to the following equation 


aL — Le) 
<= - HLL), 


wh : i cs 
ere Lo is the center of the area of depolarization due to the inspection figure, 


[3] 


NM a constant. 
iliac ir our conceptual scheme to observable variables by the assumption 
Propo e displacement, D, of a setting, observed experimentally, is directly 
rtional, by way of a constant, m, to the distance, (L-L), i.e., 
" D = m(L — Lo). [4] 
9m equations [3] and [4] it follows that 
1 dD ED. 
T€ m di m a 
n i ; 
on cancelling the m’s on both sides and rearranging we have 
dD 
<= — kdi. l6] 
0 D 
ni 3 
Integrating, equation [6] becomes 
0] 


log D=- kt C 


Orj , 
its equivalent, 

wh D=ce* 

en i H H 

ate bh the constant of integration, and ¢ its antilogarithm. We may evalu- 

Of the Y noting that when / is zero, € is equal to Do, the value of D at the start 

Process, Thus, 
[8] 


D = Dee*} 
s a serious statement of the- 


eory testing; no claim 


3 Th 
. Je reader should not consider the following account a 
ckground, see Kóhler 


Y; it nt as 
» made 4 presented to exemplify the use of transformations 1n th 
and Walls, to its precise value for perception. For theoretical ba 

ch (Proc. Amer. Philos. Soc., 1944, 88, 269-357) 
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and our theory states that the magnitude of the displacement of the equal 
setting is à negative exponential function of the time interval between pee ae 
of the inspection period and moment at which the setting is made. By ta 

the more convenient common logarithms of [8] we observe that 


logio D = logio Do — 0.4343kt. 19) 


Since equation [9] is linear in form, we may test the agreement between ci 
and theory by plotting logio D against t and observing whether the experimen 
points may be represented by a straight line. The slope of such a plot is 0- dn 
k and the intercept is logi D, Thus a logarithmic transformation of the de 
pendent variable may be used conveniently in the present analysis. 


DISPLACEMENT (MIN. VISUAL ANGLE ) 


OQ 20 40 6O 80 120 140 


TIME AFTER INSPECTION (IN Secs.) 


" 
mod 
Fic. 1. Hammer’s Data [TABLE I 1N (24)] on MEAN DISPLACEMENT OF "EQUA 
SETTINGS AS A FUNCTION OF TIME AFTER INsPECTION PERIOD. 


100 


Hammer (24) has investigated the magnitude of the displacement of 


me 
as a function of time after the inspection period, and Fig. 1 shows a plot of i 2 
of her data (her Table I, data for Group Mean, Right Displacements): f the 


shows a semi-logarithmic plot of these same data. With the exception © ht 
point at 180 seconds, the relation in Fig. 2 may be represented by (OR of fit 
line. The deviant point at 180 seconds is not far removed from the line ^. Md 
(calculated from the straight line plot) through the original data of m 
lies well within the limit of variability found by Hammer (about 1.4 ™ and in 

3. Of course, not all problems are as simple as the examples. givent pment 
many cases à major difficulty in testing a quantitative theoretical is lution 
arises from attempts to determine a transformation appropriate z Medling of 
of certain constants. An example of increased complexity in the 9%- 


e 


[^^ 
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transformations is found in a paper by Smith (43) on visual intensity discrimi- 
nation. The problem was one of testing the agreement between experimental 
data on human intensity discrimination and Hecht's theoretical account (26). 

Hecht's equation relating the adapting intensity, I, and the just discrimi: 
nable difference in intensity, AJ, is 


AI c 1 a 10 
x m atas]: us 


nts. Smith proceeded in the following way. On 


where c, a, ke, and K are consta " : 
taking the square root of both sides of [10], the equation may be written 


T. 5+ W Lv al 


or, in its equivalent form, 


(MIN. VISUAL ANGLE) 


LOG DISPLACEMENT 


140 


$0 00 170 


CTION (IN secs.) 
MENT AS A FUNCTION or TIME 


90 —20 a0 60 


TIME AFTER INSPE 


Fig. 2. Common LOGARITHM OF THE MEAN DISPLACE! 


= INSPECTION PERIOD. 
‘ata from Figure 1. 


202 C. G. MUELLER 


where 
C24/—— and C'2 ——. 
a 


Multiplying both sides of the latter equation by the square root of J leads to 
the expression 


VAT = CVT + C’. [1t 


By this development, it is shown that theory demands that a plot of the square 
root of AJ against the square root of J should give a straight line of slope C an 
150 


oo 


50 SEPARATION 415° 


45105 0 s 0 6 2 
Loc (f 


Fic. 3. Loc V/Al-c) PLorrEp acainst Loc «/I. 
In accord with theory this curve has a slope of unity. Fig. 8 in Smith (43). 


exper 


intercept constant C'. Since the range of intensities is large and the t form 


mental points are not equally spaced in the interval, a more conveinen 
of [11] is 


logio (WAT — C") = logio V/T + logio C, 
eof à plot 


when C' is determined from [11] after evaluating C from the asymptot may 
of AI/I against I. (Cis the square root of the asymptote of such a plot an with 
be estimated by visual trial.) Equation [12] represents a linear function ia 
a slope of 1.0. It is the form used by Smith and involves a number of eleme? 
transformations, including subtracting a constant and taking the logari data 
and square roots of numbers. Fig. 3 shows the agreement between Smith's 


and Hecht's theory. 


[4 
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TRANSFORMATIONS TO ÁID DESCRIPTION 


In some cases transformations may be applied for the sole purpose 
of finding an approximate descriptive expression for the relations be- 
tween variables. A problem may, for example, demand a quantitative 
statement of the approximate relationship between two variables for 
Purposes of designing equipment, controlling certain forms of behavior, 
and the like. As opposed to the situation discussed in the preceding sec- 
tion, such a quantitative expression would, in general, have no theoreti- 
cal basis, but would describe the data within the limits set by the con- 
sistency of the data and considerations of simplicity involved in the 


Selection of an expression. 

One method of arriving at an empirical formula for any particular 
Set of data is to apply various transformations to one or both of the 
variables until a set of manipulations is found which yields a straight 
line. The transformations finally selected as successful in this respect 
May provide an empirical statement of the way in which the original 
Variables are related. It is important to note, of course, that these com- 
putational procedures alone imply no theory, and the resulting descrip- 
tive expression in no way constitutes a theory of the data under con- 


Sideration. 

1. An experiment on the relation of area of a visual stimulus 4 the tiec 
threshold provided the data represented in Curve A, Fig. 4, where ie im of 
Intensity of the visual stimulus (at threshold) are plotted against values o 


: d Mote, 20). In 
area, sel h wider range (Graham, Brown, an 1 
oe n to describe these data we may syste- 


Our search for an empirical expressio : E 
matically investigate as simpler and more common —— €: = 
Spect to their success in giving linearly related varia _—— af ndn 
Stance the curve appears hyperbolic, and ARCO hi with an intercept 
Of the variables (area) results in a straight line relations quem implies that 

Value of zero, as shown in Curve B, Fig. 4. This yom o! ] 
© Product of the original variables is a constant, - 

A:I-k, 
: i kisa 
Where A is the area of the stimulus, J is the eem r hehe in for 
COnstant, This formula expresses Ricco's law which ^ 5, epe eril r 
error cave in the fovea and periphery; it has bear EN d Wote, 20; Wald, 
an Bver large ranges of stimulus area (Graham, tion" or a theory of arca- 
i *quation [13] in no way constitutes an i iai "ve it no rational basis; 
My relations; the opérations — Ded aa uppith correction 
in K it may pro / 
facto E restricted range of data it ntly sized areas or adequately serve 
Som. !9r comparing thresholds of differently d Mote have shown that, 
n € other Practical purpose. Graham, Browns 9 tion fits the data over a 
n op onal grounds, an entirely different sort of equa 
k larger r f sti 5 . taken 
ange of stimulus area. ration may be ta 
Another il: of the procedure under conside 
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from the data of Schoenfeld, Antonitis and Bersh (41) which are presented in 
Fig. 5, where the median number of responses (bar-pressing responses in a modi- 
fied Skinner apparatus) in a one hour test period are shown for six successive 
days. The animals had no history of reinforcement in this situation and €: 
not reinforced during test sessions. The data therefore represent the opener 
rate of responding” on successive days. We guess that the data are exponenta 


RECIPROCAL OF AREA 


10 45 20 225 30 35 40 


THRESHOLD INTENSITY (ML. x 1073) 


» 9 zo 30 40 50 60 


STIMULUS AREA (SQ. MIN.) 
L 
Fic. 4. THRESHOLD INTENSITY PLOTTED AGAINST AREA (CURVE A) AND RECIPROCA 
or AREA (Curve B). 
Data from Table II in Graham, Brown, and Mote (20). 


s int 
in form, and so we take the logarithms of the ordinate values as a starting pote 
in seeking an empirical description of the results. Since the curve m 
seems to approach an asymptote which is not zero, a constant (the asyMP y ms 
value) must be subtracted from the number of responses before the Jogar!" of 
are taken. If we take this asymptotic value to be 9.0, subtract it from ^e, 
the median values plotted in Fig. 5 and take the logarithm of the ene 
we obtain the results shown in Fig. 6. The data deviate little mo 1 y an 
which indicates that the original data may be approximately describes tegral 
exponential equation. Obviously the equation has significance 0n. y 


i 
» 


Sy 
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MEDIAN NO. OF RESPONSES 


BAYS Test Days. 
Fic. 5. MEDIAN NUMBER OF RESPONSES IN AN HOUR ON DIFFERENT ES . 
Data from Schoenfeld, Antonitis, and Bersh (41). 


LOG MEDIAN NO. OF RESPONSES 


5 6 


DAYS 


Fte. 6. Log Meptaw Nusmper or RESPONSES IN AN Ho 
Data from Fig. 5. 


UR ASA FUNCTION OF Test DAY. 
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abscissa values. Theoretically, an infinity of curves might also represent the 
data. Possibly an hyperbolic expression would serve as well as the exponentia 
and would be no more complex. 


TRANSFORMATION FOR STATISTICAL TEST 


A third situation in which one may encounter a need for transforma- 
tions is in the case where adjustments are made to permit the use of a 
particular statistical test without violating the assumptions involved in 
the efficient use of the test. This class of problem has been of consider- 
able concern to mathematical statisticians and applications of their 
theoretical work may be found in the scientific literature (Beall, 4; 
Eisenhart, Hastay, and Wallis, 13; Williams, 47). 

Univariate distributions. Consider a simple example of an experiment 
in which we measure the reaction time to the onset of a specified visual 
stimulus under several conditions. Suppose our experiment has yielded 
several sets of data, distributions of number of reaction times of various 
magnitudes. In such an example we are dealing with a univariate dis- 
tribution, i.e., a frequency distribution with respect to a single variable. 
Since experiments on reaction time (e. g., Jenkins, 29) have shown that, 
under many conditions, the frequency distributions of reaction time are 
not normal but positively skewed, problems may develop in the appli- 
cation of many statistical tests. If marked skewness exists, à direct 
application of "normal curve statistics" would undoubtedly lead t? 
error because of the lack of correspondence between our data and the 
conditions imposed by the test.* However, several procedures for avoid- 
ing this error are open to us: we may (1) use a statistical test applicable 
to our type of distribution if such a test exists, (2) use a test whic! 
makes no assumptions about the population from which our sample 8 
drawn, or (3) transform our scores into a normal distribution SO tha 
we may use one of the many tests available for this distribution. d 
with the latter procedure that we are here concerned.® 


* The degree of error, of course, is the important thing in any specific situation- If E 
error is not large the experimentalist will be willing to ignore it in the interest ° " 
plicity of calculation. There is some evidence [see Smith and Duncan (42)a 
(7) for discussion and references] that minor deviations from normality in 
samples do not seriously impair the sensitivity of certain statistical tests, 
stricted generalization of belief in this state of affairs to all cases of non-norm 


i í : B z E ; 1 sam 
pecially in small samples, is certainly to be avoided. Evidence is clear that, in gme don 
ples, serious error is introduced by ignoring the form of the population distri dis- 

i (8) for 


[Rider (39), Festinger (16); again see Smith and Duncan (42) and Cochran 
cussion and references}. 

5 ]t is not the purpose of this paper to discuss all of the methods 
from populations having non-normal or "unassumable" characteristics, bu 


ods of handling dat? 
t some or! 


i 
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The general problem that confronts us is one of transforming one 
Set of scores having some specified frequency distribution, y —f(x), into 
another set having a distribution, y= F(z). It is required that we find 
some function s=g(x) such that when transformations are made from 
* to z, z is distributed in a manner specified by y — F(z). 

Although conversions of a set of scores into any specified frequency 
function is possible, the most frequently encountered problem is one of 
making the data approximate the normal curve. The interest in having 
the data approximate the normal curve arises mainly from the greater 
amount of information available on the sampling characteristics of the 
parameters of this function; if the data can be put into the appropriate 
form, larger resources exist for testing statistical hypotheses. 

At least two possibilities are open to us in our search for an appro- 
Priate transformation. The first presents us with a solution of the general 
problem to an approximation which can be made as exact as we please. 
Several solutions of this type are available and involve an approxima- 
tion to the desired function by using a power polynomial or series func- 
tion [cf. Baker (1), Fry (19), Kendall (30)]. These solutions are general 
and may be used on any continuous distribution. The second possibility 
available to us is to see whether one of several convenient transforma- 
tions, such as taking the logarithms or reciprocals of the scores, gives 
an appropriate and sufficiently precise solution. Logarithms or recipro- 
Cals provide an exact solution in those cases in which z-g(x) is a loga- 
rithmic or reciprocal function; practically they have been found to 


entation with respect to the use of transformations may be gained if some of the other 
methods of handling data are mentioned. Although the emphasis on sampling soa 
tions of parameters of non-normal populations has not approached that ee o pu 
Curve functions, work on non-normal distributions [e.g., Festinger aw., M — 
May eventually decrease the advantages of recoding scores to test xin ica ipu: 
Y providing analogous tests for non-normal functions. In the end anms ol — 
istribution of reaction time scores we might test for the differences Siam : E d 
two such distributions by using a test suggested by Festinger (16). Unfortunately suc 
techni a H ] 
fersat do stitial test which makes no assumptions about the population 


distributi i n increasingly important procedure to research work- 
paren probably peronieis rmation present in our scores by dealing 


ers. If we are willi ifi e of the info: 
k willing to sacrifice som : X 
with ranks or ci of scores, statistical procedures are available for testing many 


i ion i o [Hotelling and Pabst (28); Pitman 
yPotheses, e.g., that population correlation is zer [ E oH wad Wiitosfis AS), 


(37)] and f the same popul: 
d that two samples came from : 2 
!tman (36); Mann and Whitney (32); Festinger (17); Dixon leg - endis ei 
Since the latter di Il the available information; 
o not use all the avari ker. 
“sts of the latter type presents a definite advantage to the research wo: 
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provide sufficiently accurate approximations in a large number of cases 
encountered experimentally. 

If we were interested in approximately “normalizing” our reaction 
time data, some convenient transformation may serve as a first step. 
Jenkins (29) has shown that the logarithms of reaction time scores are 
approximately normally distributed, and this transformation may be 
our first choice. If we had two distributions on a single dimension 
(such as two sets of reaction time scores) and were interested in signifi- 
cance of the differences between means, such a transformation would be 
an appropriate step preliminary to the application of a t-test or some 
similar test of significance. 

Bivariate distributions. In the field of statistical analysis one fre- 
quently encounters the variate transformation in situations which in- 
volve two variables rather than one. 

Transformations, as applied to bivariate distributions, become an 
important tool of analysis in two cases: (a) in the fitting of theoretic 
or empirical curves to data, and (b) in the application of tests of sig- 
nificance involving an "analysis of variance." Most applications 0 
"analysis of variance" techniques and most fitting of curves by the 
method of "least squares" involve the assumptions of normality and 
of homogeneity of variance in the various columns.’ If the heterogeneity 
of variance exists and results from a correlation between mean values 
and standard deviation values, it is frequently possible to find a simp’? 
transformation by which variability may be equalized. Examp es 0. 
useful transformations in the application of ‘‘analysis of variance 
tests have been extensively discussed by Bartlett (2, 3), Cochran i 
Curtiss (12), and others. In the present account we consider a few © 
amples taken from the field of psychology. 


1. Many groups of data in psychology show a variability which is appro" 
mately proportional to the mean. For example, the measurement of differen ity 
thresholds in sensory psychology provides data demonstrating a proportion® 5 
between the mean and standard deviation over a considerable rang? ° tain 
controlling variable, intensity. If we attempt to analyze the effect of rds 
variables on mean differential threshold by using "analysis of variance pens 
niques we require some method for equating the variances in the severa ie 
In such a situation it may be shown that the variability is roughly equa 

rithmic transformation. . : Ho 
: gom an example in a visual intensity discrimination experi us 
finding a transformation such that the variability of AJ is constan i; approx” 
values of AJ. Crozier (10) has shown that the standard deviation © 


imately proportional to AJ, that is, E E 
in ^ : ic ty 

6 Exceptions occur in “anal oi parametr - jr 

man (38)] and in "least squares” f 


below. 


lysis of variance” tests of then 
solutions utilizing a weighting term, 
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car = RAI, [14] 


wee kis a constant. We require a transformation to y, where y =f(AJ), such 
that the standard deviation of y is constant for all mean values of y. The vari- 
ance of a function of a variable can be represented approximately’ (30) by the 


equation 
os? = os! (ds/dx)*, [15] 
where x is the variable and s=f(x). From [14] and [15] it follows that 
oy = kAHdy/A(AD], 16 
Where y 2f(AI). If we let oy equal a constant, a, and transpose, we obtain 
dy/d(AI) = a/kal. 17] 


The function, y 2f(AI), which has the latter derivative is (a/k) log, AZ; and so 
logarithms of AI are the appropriate numbers to use in this type of experiment 
when homogeneity of variance is presupposed. 

2. Sensory psychology is not the only area from which examples may be 
drawn; many studies in conditioning and learning, for example, present similar 
problems. Asan example taken from the field of conditioning we may examine 
the data of Zeaman (48) on the changes in latency of a running response during 
acquisition. Zeaman used a straight runway of the Graham-Gagné type and 
included in his measures the time interval between the opening of the door to 
the runway and the entry of the animal onto the runway. An examination 
of the raw data shows (1) that the distribution of latency scores for a group of 
animals on any particular trial is markedly skewed and (2) that the variability 
of each distribution varies with the mean value. 

Assume that we are interested in arriving at a “best fitting" line to repre- 
sent the data or in testing the hypothesis that there are no significant changes in 
average latency from trial to trial. Most methods of satisfying these interests 
require that the distribution of scores for each trial be normal and that the 
Variability be constant from trial to trial. This is not the case with the latency 
SCOres, so we seek some transformation that will yield scores satisfying these 
Conditions. We hope that the transformation that stabilizes the variability 
in the various trials also normalizes the distributions for each trial. In view of 
the success of the logarithmic transformation in approximately “normalizing 
Some reaction time data, we may begin our analysis by taking the logarithms of 

€ latency measures. To test the success of this device in performing the de- 


. 7 Equation [10] and the development that follows involve several severe approxima- 
tions; they are presented for their heuristic value to those not familiar with the argu- 
re For some difficulties with this development and a more rigorous account of the 

ansformation discussed here (and several others) see Curtiss (11). 


In additi C itisi tant to note that the operations 
ition to th imations involved it is importa 1o 
gie , even though they purport to stabilize 
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25 Poisson distribution, logarithmico-normal distribution, eter the transformation which 
i ilizes variability will also normalize the data. Forms of variance heterog' 
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Pes however, in which this would not be the Be ? 
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above specified log 


sired function let us cumulate the percentages of cases falling H 
paper with log 


latency values and plot this cumulated percentage on probability 
latency on the abscissa. The cumulative plot of a normal distribution shou 

givea straight line whose slope is proportional to the standard deviation. Sucha 
plot of Zeaman's data is shown in Fig. 7. The data for the odd numbered trials 
are shown. The approximate normality of the distributions of scores 
trial is obvious from the fact that the points may be approximate 
by straight lines. The homogeneity of variance is demonstrated b 


for each 


ly represente 
y the fact that 


99 


CUMULATED PERCENT OF CASES 


23 21 9 7 L5 [E] ia 3 1 


LOG LATENCY (IN SECS.) 

Fic. 7. ZEAMAN’S DATA (48) ox Opp NUMBERED TRIALS (i T 
LATED PER CENT OF Cases (PER CENT OF CASES FALLING ABOVE TH 
PLorrED ON THE ORDINATE AND THE COMMON LOGARITHM OF TP 


ONDS, ON THE ABSCISSA. 
The lines, from left to right, represent trials 1, 3, 5, 


o 11), WITH heu 
E ABSCISSA L 
ig LATENCY, IN 


SEC 


7,9, and 11, respectively: 


the lines through the points for various trials are nearly parallel. Th SP onc 
ment of the lines along the abscissa shows the changes in the average o! pn a 
during the course of acquisition. This simple transformation has Pis thats 
function in Y (log latency) and X (trials) which has the characteris : ciable 
for changes in X, we obtain changes in the mean value of Y but no Y values 
changes in the variability of. Y or in the form of the distribution 9,1 on th 
This result confirms a much earlier report by Graham an Gagné, nar situa" 
advantages of using the logarithms of the latency measures 17 a 


tion. 


i re, 
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phasized in the statistical 
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of scores is normally distributed along some dimension, then scores along any 
non-linear function of this dimension will not be normally distributed. In the 
present instance the fact that the logarithms of the latency scores are distrib- 
uted normally means that the latency scores themselves are not normally dis- 
tributed. A plot, on probability paper, of the cumulative frequency of the la- 
tency scores against the magnitude of the latency shows a markedly curved line 
which shows deviation from normality, and the degree of curvature varies with 
mean value or position on the probability grid. The existence of such informa- 
tion, of course, argues against the application of normal curve statistics to the 
raw scores. À . 

3. The examples considered so far by no means exhaust the areas in which 
transformations are, or should be, an important step in analyzing experimental 
data. Space permits only brief mention of some other examples. An important 
paper by Morgan (33) shows the advantage to be gained by transforming data 
Obtained in experiments on food hoarding in rats. Morgan shows that the 
logarithms of the number of pellets horded are distributed more normally 
than are the numbers themselves and that variance under the various con- 
ditions is more nearly equal with transformed scores. . 

Haggard (22) eid nd and Garner Q3) have considered the ler 
of the transformation of experimental data in some detail in a discussion o 
the analysis of the results on the galvanic skin response. Haggard and ues 
show that the variability of the GSR measures 1s approximately propor ara 
to the mean value of the GSR, a consideration which, as we viet sum 
leads to a logarithmic transformation if we seek scores showing = en: a i Y 
In order to scale the GSR measures, Haggard and Garner employ an additiona 
computation which need not concern us here.? 

? Exactly how the transformation finally selected equalizes pucri Go zaaien o 
equalizing the response magnitude to a constant stimulus) is not clear. Ls aiam = 
Garner present data on (1) the magnitude of the galvanic skin nso hé edie: 
the resting resistance and (2) the variability of the GSR. The data an : ri ium 
of the response as a function of resting resistance 1S y an eq 
orm 

AR = ad, [al 
he GSR is measured; R, the level of resistance 
onstants; and e, the base of Napierian loga- 


represented b 


ite AR is the change in resistance when t! 

Just before the response occurs; 4 and 5, c 

rithms, From [a] it follows that i 
loge AR = log, a+ oR 


and that 
loge AR — loget _ b t 
108. aM = b. 


R 
Haggard and Garner 


by the statement that the standard 
ly proportional to the mean value. 


i ed. 
left side of equation [c] indicates the transformation us 


Su. re ic 
™marize the data on the variability of the response 


e i H e 
Mp of the response magnitudes is approxima te ba ae =e aR valies 
i nc R 
que Lees eom e SES divide these log values bya variable 
wi 


Sho; f 
uld therefore show equal variability- S paiid E om nions 
> R, as dictated by equation [c], variance heterogeneity 


tm, R 
asgard and Garner indicate that this is not the case- 
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SoME THEORETICAL CONSIDERATIONS 


Transformations required by theory. The selection of the appropriate 
transformation to “normalize” the data or to equalize the variability 
in the cases discussed so far has been on the basis of empirical considera- 
tions; we have been interested in satisfying certain conditions, aS, for 
example, the requirement that the transformed scores be normally dis- 
tributed and/or show homoscedasticity. On this basis we sought trans" 
formations which would serve our purpose. Obviously, there are many 
transformations which will satisfy any specified condition to any eve 
of approximation we may designate, and our selection from among these 
possibilities has been on the basis of simplicity, ease of calculation, a? 
degree of approximation desired. In certain instances, however, other 
considerations may enter to determine the form in which the data may 
be most fruitfully analyzed, and it is here that we encounter the role o 
scientific theory in restricting the range of selection of the appropri? e 
dimension for analysis. 

Earlier it has been shown that, where there is evidence that the mag” 
nitude of variability is roughly proportional to the mean value, We ma 
approximate homogeneity of variance by making a logarithmic trans" 
formation of the data. Frequently we are led to make such à transfor 
mation on the more indirect basis of a successful theoretical formul 
in the field with which our experiment is concerned, even though € 
data may not contain conclusive evidence that variance heterogene! y 
would occur without such a transformation. 

1. Consider the case in which changes in some dependent variable 9f 
function not only of the changes in the independent variable but also some '. .. 
tion of the magnitude of the dependent variable. A model for such 4 re atch 
ship may be found in many photochemical systems, for example, those in Y put 
the amount of decomposition is a function not only of the intensity of ^! pr 
also the amount of photosensitive material available. 

A general form of the kind of theoretical statement frequent! 


in photochemistry is " 


— dy/dt = kIfO) A 
pt intensity ki 


y encounter? 


where y is the amount of photosensitive material, I is the light 
a constant, and f(y) is usually y or 9”. Rearrangement of [18] yields 


— kIdi = dy/f), 
from which it follows that g 


— m= f anro» 


"E per of ^; 
Tt may be shown (Cramer, 9) that, if variations 1n the left ane them in 
own vy activities of independent random varia 
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the limi i i 
it, the integral of the right member can be represented in the same way 
(y) is equal to y, that A 


If w z 
E bey Ps an example, the simple case in which fl 
Ves Phan Pres proportional to y then the right side of equation [19] 
Sine, a ea? a He y. Thus, if variations in It follow a normal Gaussian func- 
eA aui ual ind that the logarithms of y values would be similarly distrib- 
2 pui ue involving y must consider this possibility. . 
E rc [es pud does not provide many examples of transformations 
C able from theoretical considerations, some cases may exist. Let 
sider the food hoarding data discussed by Morgan (33). It will be re- 


called : : 
his i Morgan found a logarithmic transformation to be useful in treating 
a. Assume that, i.e. one representing 


for any experimental conditi 
Cons ivi B : : Y M. pix! 
e Ln drive level, illumination conditions, amount of infant feeding depriva- 
"n m there exists a fixed rate of occurrence of the hoarding response an 
obs responses under any fixed experimental conditions are distributed 
(P,) ^ y in time. For events distributed randomly in time, the probability 
n) of obtaining n events ina time interval, #, is 

(rt)ne7* 

Pam GUN 120] 
of the events and e is the base of Napierian 


l . 
ogarithms. Thus, the number of events observed in intervals of fixed length 


experi h 3 wd 5 

com test interval) will be distributed according to the Poisson law. 
V characteristic of the Poisson distribution is that the mean is equal to the 
riance, which means that any ges the mean thereby 


cha condition which chan 
nges the variability. By reasoni d in finding the trans- 


ies ng similar to that use! 
ds -€— for the case where the standard deviation is proportional to the mean, 
find that the transformation 


cl a separ pol nsforma that approximately equalizes the variability 

Curtis of Poisson distributions is the square root transformation. [Again see 

porn (11) for a rigorous proof] Thus, from the assumption that the re- 
s are randomly distributed in 


wer time for any experimental test condition, 
in Aimed employ the square roots of the number of hoarding responses as data 
ituations where homogeneity of variance is pr! 


ef esupposed. The author knows 
ton. test of this notion, but presents it as an example of the kind of considera- 
n which may enter. 


w ; 
here r is the rate of occurrence 


of transformations. Although the 
to stabilize the variability of one 
iable, X, may be found ir 
ds of statistical tests ma} 
reason for seeking homo: 
bivariate frequency func: 


T" oe of “laws” by the use 
can e a most deliberate attempts 
the a x Y, for various values of another Var 
not e ne ah of statistical tests, the deman 
geneit the only, or even the most important, 
tion y of variability. One characteristic of a biva : 

With normal distributions of equal variability 1n the various column: 


as s 
not been emphasized in the li When the values of one vari 
istributed with equal vari 


abl 
hia Y, are such that they are normally d n 
a for all values of X, the fun : o the two variables, ] 
X, does not change in form U ion of the measure © 
tative value employe 


Positj 
ion, i 
n, i.e., under a change of the represen 
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(mean, median, mode, centiles, etc.). Although there are statistical 
bases for choosing among the many available measures of position, 
these bases may not appear compelling to the theorist if other consider- 
ations of importance enter, such as the observation that different func- 
tional relations between variables are obtained depending on the meas- 
ure used. For example, the mean may be rationalized as the most ap- 
propriate measure of position for the normal distribution because it is 
the most efficient statistic, i.e., no other statistic has a smaller sampling 
variability. The differences in efficiency, as for example between the 
mean and median, however, are hardly of a magnitude to dictate the 
particular form of analysis if complexities of theory result. When the 
sets of Y values for various X values yield normal distributions of equal 
variability no problems of the multiplicity of functional relations arise; 
since the mean, median, and mode have the same value and the selec- 
tion of any centile or other fixed position in the distribution merely 
results in a displacement of the function on the ordinate, or Y, scale 
by some specifiable amount. The correlation between the mean and 
standard deviation in cases of skewed distributions, however, guarantees 
heterogeneity and greatly increases the likelihood of actually finding 4 
multiplicity of functional relations upon changing the positional meas 
ure. Thus, if our original data are skewed we could show, for a single 
experimental condition, as many "empirical behavioral laws” as there 
are measures of position for a distribution. : 

It seems highly unlikely that theoretical psychologists will find it 
fruitful to attempt to formulate the infinite set of theoretical statements 
necessary to handle all the “laws” that would result from any changes 
in the measure of position or central tendency. In the case of skewe 
heteroscedastic distributionsit would seem much more feasible to provide 
a rational account of the transformation required to yield a homogeneous 
set of normal distributions. 


, " " 1 

An example of the difficulties encountered when dealing with non-norm* 

distributions is available in the study by Felsinger, Gladstone, YamagUn. 
and Hull (15) on reaction latency. Curve A in Fig. 8 shows the cumulated 


: : de 

tribution of reaction latencies presented by these authors plotted on pue 
cel 

paper. rva- 


ture) and confirms the reports of other investigators (Graham and Gaps that 


Zeaman, 48). In the light of the skewness of the data, it is not surprisin 
the authors find different functional relations between reaction | 
number of reinforcements if the mean or the median of the distrib aa 
es is taken as the representative value. As in the case of €— an 
ta in Fig. 7, we find that the logarithms of the time measur Bin 
lly distributed as shown by the linearity of Curv 


ution of I2 


tency scor! 
latency da 
approximately norma 


OA 


i 
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Fig. 8, whi 

1g. 8, which re 

with presents a cumulativ 

log latency values on the S S plot on probability paper 
LATENCY (IN SECS.) 


475 575 675 7.75 875 


75 i75 275 375 


CUM 
ULATED PERCENT OF CASES 


LOG LATENCY (IN SECS.) 


Fig. 

8. G 

SCIssA Wives ese fa Per Cent or Cases (PER CENT OF Cases FALLING BELOW AB- 

ATENCY ils E Funcrion oF THE LATENCY (Curve A) AND THE LOGARITHM OF THE 
ve B) 

ata fri Fils 

om Felsinger, Gladstone, Yamaguchi, and Hull (15). 

thmico-normal distribution, 

f statistical efficiency, since i 


MUI 
mould ee dealing with a logari the arithmetic mean 
etric mean s its usual advantage 0 n this case the geo- 

shows a smaller sampling variability. 

by the use of logarithms, it is im- 
under any circumstances, be 
nacea for the experi- 
does have wide ap- 
y situation must 


Ince w 

is to esr er apr to solve several problems 
men a as beim that the present discussion must not, U 
lie pieta Analytical that the logarithmic transformation is à pa 
be m ility, but ob ical problems. The logarithmic transformation 
© on the b; viously the selection of a given transformation in an 

9th n many of asis of merit. 
A imple hike areas from which we have drawn examples, 
? closer is ormations may satisfy our requirements or that a 
ang action tim, pproximations to normality. For example, there is son i 
ata iris scores retain some of the positive skewness of the original disi 

og transformation gives a better approximation to normality. 

dition, deviations 


als, 
m5 
e 
ue wh 
ert 
€ latency measures are employed. In ad 


there is evidence that 
dditional conversions 
is some evidence that 
tributions 
This may 
of the sort 
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SowE CONSEQUENCES OF THE USE OF TRANSFORMATIONS 


Frequently we find that the selection of a transformation to simplify 
the form of a theoretical statement may preclude the possibility of 
normalizing the data or equalizing the variability. Thus, transforming 
our data for theoretical reasons may complicate the process of meeting 
the assumptions of statistical procedures. 

Curve fitting. Consider the case of fitting an empirical or theoretical 
curve to the data of Fig. 1. Assume that we wish to fit the curve 
D=Dve-* to the data in Fig. 1, where £ represents the time since the 
end of the inspection period and D represents the displacement of the 
setting. We have seen earlier (equations [8] and [9]) that one method 
of testing the applicability of such a function and, at the same time, 
providing an evaluation of its constants is to plot logio D against ^ 
Such a plot should give a straight line with a slope of — 4343 k and an 
intercept constant of logwDo. If we are satisfied with an approximate 
indication of the relationship between the variables and a subjective 
estimate of the degree of fit, fitting the line by visual inspection satisfies 
our needs. For the purposes of the present discussion, however, let u$ 
consider a more "exact" method of evaluating the constants involve 
and note some of the problems that arise from such a consideration. 

Our first problem arises in the selection of a statistical solution of the “best 
fitting" line. Several techniques for obtaining this “best fit" are available, but 
we shall arbitrarily select one for discussion, the method of least squares. 
method serves as à good example and has the advantage that it is widely em 
ployed and has been extensively developed. The method is characterize 


the assumption that the “best fitting" line is that line about which the weight? 
squared deviations of data from the line give a minimum sum. 


shown in the first trial in Fig. 7 may prove to be replicable. For the special cases of a 
si 
u 


logarithmico-normal distribution where the lower limit of the scores is not zero the ie 
traction of a constant (which is the lower limit of the scores) from each score before nis 
logarithms of the numbers are computed may be necessary to insure normality- T e 
consideration arises from an inspection of the parameters of the generalized form of log 
rithmico-normal frequency function, which is (Cramer, 9), 


(dl 


= e loloa EL 
c(x — a)v/2v 

where ø is the standard deviation, and m, the mean of the log distribution; *» a sop 
a, a constant; and e, the base of Napierian logarithms. The adequacy appro? 
logarithmic transformation in any of these cases will depend upon the degree 9 
mation required as well as the particular characteristics of the data. 

In addition to this matter of emphasis, it is to be noted that many use 3 
tions, for example the arcsin transformation (Zubin, 49; Eisenhart, Hastay, a 
13), have not even been mentioned because of limitations of space- 


form? 
ful trans nilis 


— P À — 
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propriate weighting of points is one of the impor- 
pom atio: fitting, although itis one which arises only implicitly 

n most applications of the least squares method. We know that the combina- 
tion of several distributions of raw scores into a single distribution effectively 
weights the separate distributions in proportion to their variability; that is 
although we ostensibly weight each distribution equally by merely adding the 
scores to obtain the final distribution, implicit weighting arises from differences 
in variability in our original distributions. We shall see that this factor plays 
an important role in the solution of best fitting lines in the cases of original and 
transformed data, since the usual normal equations 

aN + bx = Zy 
ax + bEx? = Ery 

that is, they assume homoscedas- 
long the fitted line. The use of such equations in 
Ids a line around which the sum 
of the squares of the logarit a minimum. If the variability 
of Y is equal on a logarithmic scale, i.e., if log Y values show equal variability 
for all values of X, each point on weighted. But if the Y 
values themselves show equal vari bility for various values of X, using equa- 
tion [21] differentially weights ea of its Y value. Thus the 
solutions of a and b obtained by fitting a straig ine t he transformed data 
by the least squares formula [21] differs from the solutions obtained by fitting 
a curve directly to the raw data on the basis of e In 
the present case this means that the numerical values of Do and k obtained from 


the data of Fig. 1 depend upon xponential curve: j 
data, or on the basis of astraight-line plotof logo D against $ using equation [21]. 


It seems clear that the choice between the constants obtained in the one 
case by fitting a straight | formed data by equation [21] and in 


the other case by direct application of least squares to the exponential form de- 
Pends upon theoretical an iderations; these considerations 


must be specifiable if the solution of the constants in the function is to have 


Precise signi ithout à “weighting” term 
; gnificance. If the least squ for h 
I5 applicable to the untransformed scores, it is not applicable to the transformed 


it an wn i f the sort j 
seems i i consi s 0 3 
Sly Led. p for whom the exact numerical value of constants 
E important. Variability factors 29 well as normality factors are too important 
9r the solution of these parameter values to expect that we can efficiently esti- 
nd give any interpretable statement about confi- 

and related topics. 


g specific attention to these e 
dure is increased if we employ 


each point. The required for- 


The problem of the ap 
tant considerations in curve 


Q1] 


ust outlined be seri- 


bsec generality of our “Jea 

lae 2. which incorporate a term fo 
are given by the equations 

agw + bXws = uy 2 

aEwx + paws? = zwxy 

ocal of the variance. 


ed as the recipr 


d is defin In addition to the usui 


e, P Where 
i 7 V represents the weight an 4 Q2] aem 
are 0 - 


Sais aoe 
imilarities of formulae [21] an 
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computations, however, equation [22] requires that we have a method for de- 
termining the variability of transformed scores when we are given the vari- 
ability of the original scores. 

We have seen that an approximate solution for the variance of a transformed 
distribution in terms of the variance of the original distribution is given by 
equation [15] 

os = 0:%(dz/dx)?, 
where z is the transformed score; x, the original score; and o?, the variance of 3 
or x scores. Consider a logarithmic transformation. If the variability is small 


compared with the mean, then the log of the mean of x is approximately equal 
to the mean of log x, in which case 


a = (1/2)%0,7. [23] 


Equation [23] allows us not only to determine weighting terms but it also 
allows us to evaluate the error in curve fitting introduced if account is not taken 
of implicit changes in weighting. 


The combination of the weighted least squares formula and the gen- 
eral solution for a set of transformed scores greatly increases the 
generality of the method of curve fitting and permits other than statis 
tical considerations to dictate when and what transformations shall be 
made. In particular, convenience or simplicity of the form of our theo- 
retical statements may now play a larger part in determining the analysis 
of agreement between data and theory. 

Correlation. Discussion of the least squares method of curve fitting 
leads to another example of the bivariate distribution in which similar 
assumptions are made and similar consequences figure. The example 
is the product-moment correlation. Although a formula for the correla- 
tion coefficient may be developed without assuming a normal bivariate 
distribution, it must be emphasized that such a proof is not a sufficient 
condition for the extension to all cases of the large superstructure o 
correlation theory which is associated with the correlation term in the 
normal bivariate frequency distribution, particularly that portion ° 
correlation theory having to do with sampling, significance tests, etc- 
It is true that one may compute a correlation coefficient for any group 
of paired numbers, but the interpretation to be placed on the zen E 
this computation is not always clear. Statements about the samp d 
characteristics of the correlation coefficient, inferences relating to pu 
lation parameters and differences between parameters, are, for the m 
part, limited to normal bivariate distributions. 

Our discussion points to a few factors which may be ex 

limit the application of correlation techniques to non-norma 
obvious changes are introduced in the normal biva 


pected to 
1 popula 
T riate dis” 
tions. Two 


FQ ——— a r a 


" de — 
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ui Lo non-linear transformations are made in one of the 
eiii E : he first of these is a change in the regression line; if the raw 
ili; a inearly related, then the transformation of one of the vari- 
ie ae this relationship. The magnitude of this change will 
Uodnesd ¢ epend on the transformation made. The second change in- 
abilit a also of a type mentioned earlier, that of a change in vari- 
hn of the distributions of the transformed variable. Both of these 

rs guarantee that at least one of the forms of bivariate distribution 
n one involving transformed scores or the one with the raw scores) 
b ot normal, and different values for the correlation coefficient will 

€ obtained by performing similar computations on the two forms. 


m The importance of the form of the bivariate frequency distribution for 
Bue meni correlation between variables is considered in papers by 
regr iel (14) and Stephan (44). In discussing the importance of linearity of 
| ession in the product-moment correlation, Ezekiel points out that the 
ogarithmic transformation of one variable frequently converts a curvilinear 
regression to approximate linearity and that product-moment correlations are 
increased in such cases. In one example, however, Ezekiel showed that such a 
transformation not only did not increase, but actually decreased, the correla- 
tion. The example was one in which the per cent protein in grain was correlated 
With the per cent vitreous kernels. As pointed out by Stephan the decrease in 
°rrelation is not surprising since the logarithmic transformation was not appro- 
oe in this case and merely served to increase the curvature of the line relat- 
on the variables. Stephan showed, however, that by transforming one dimen- 
the E per cent non-vitreous kernels (rather than per cent vitreous) and taking 
the logarithms of these values, the regression became approximately in aug 
reg Variance around the straight line fitted by least cong hint a y 
tio Uced. The correlation between the original variables was .73 and the a a- 
n after transformation was —-98- Thus, with the same experimenta ata 
an | Without changing the rank order of the scores, the presumed "amount of 
“ance accounted for” was almost dae X. ion of experimental 
be Another example of a case in which the interpretation 0 a tion 
d would have been markedly different if nto PPop amond (84) ds 
9t been employed is an experiment by Mueller a ` € J 
S 6 n ipee * f measuring visual acuity. 
he pidan d reliabilities of s «p mi) cetesenter for each subject, 
Were th na data, numbers recorde y lassify each size of figure in com- 
merci © denominators of the fraction used to classily ch iP fons ie a constant 
an tal tests of visual acuity. The numerator for all such fra 


typically 20, The fraction has the dimensions of a visual ee has 
; oretical status as a variable in vision. ese con- 


:etributi e fractional scores for the 
hie distributions P positively skewed) and 
Oa ia de linear, left little doubt as to 
y analysis. When the fractional 

orrelations (product-moment), 
An interest in the importance 


Consi 
side erable empirical and the 


t er; 
several ons and the facts that t 
the re tests were approximately n 
the . Sessions between the tests approxima 
Scor ‘PPropriate dimensions for the reliabilit 
the fey ee used in computing the test-retest C! 
were as given in Table I, column 4 
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TABLE I 
Test-Retest RELIABILITIES FOR Six TESTS OF AcuITY. 
Column (a) shows the results when the acuity scores are used; column (b) shown the 
results when the reciprocals of the acuity scores are used. 
Reliability 
Test á b (reciprocal of 
a (acuity scores) acuity scores) 
A .81 .81 
B .75 -79 
c .69 .86 
D .68 .76 
E .61 .81 
F .56 .65 
of the form of the bivariate distribution, however, prompted an analysis of Re 
test-retest correlations using the originally recorded scores, rather than , 


s are presente 


fraction scores. The correlations in the case of the original score : A 
(using Fisher $ 


in Table I, column b. Tests for the significance of the differences 

z transformation) in Table I, column a, leads to the conclusion that test A has 
a higher reliability than tests C and D, at between the 1 per cent and 5 per cent 
levels, and higher than tests E and F at less than the 1 per cent level. The 
same significance tests applied to the data of Table I, column b, lead to the 
conclusion that test C has a higher reliability than D at between the 1 per 1 
and 5 per cent levels and higher than F at better than the 1 per cent er 
Obviously, such a result constitutes a dilemma only if we ignore the importan". 
of the frequency characteristics of the data to be analyzed. Actually, bs be 
reasons to doubt the applicability of our test of significance in the case © 
untransformed scores. The point to be emphasized is that test- 
tions on the recorded data would have led to conclusions incompa 


results of a more appropriate analysis. 


cent 


retest CO 
tible with ^ 


SUMMARY 


This paper considers several circumstances which may call for the ae 
merical transformation of data: (1) testing theory, (2) finding empi"! o 
descriptive expressions, and (3) satisfying conditions of applicability : 
certain statistical techniques. The application of transformatioP 
each of these circumstances may be straightforward; nevertheless. d 
count must always be taken of the degree of approximation single 
or permissible. In addition, situations involving more than em <form 
case of (1), (2) or (3) will frequently call for incompati le e» 

mations. Such a case might arise, for example, in testing à formed 
cal prediction of linearity of relationship between two tra ity 
variables; here it might be found that non-normality 2 


f 
f 
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jello around the theoretical line are introduced by the trans- 
ing operations. Such a state of affairs may make certain statistical 
tests inapplicable. Approximate methods are mentioned by which some 
= of this sort may be solved. When sufficient information is 
ree these methods may be of value in determining the magnitudes 
of desired constants. Little is gained by employing these statistical 
methods of curve fitting if adequate information is not present. 

For the theoretical psychologist many important considerations, 
other than those concerned with statistical tests, attach to the varigus 
kinds of distributions of two variables. Certain forms in which data 
may be expressed may lead us to as many “Jaws” as there are measures 
of “central value” of groups of data. In such cases appropriate trans- 
formations serve to reduce the multiplicity of forms of functional relation 
between variables, and these transformations provide both a subject 


matter and a tool for the theorist. 
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HISTORICAL BEGINNINGS OF CHILD PSYCHOLOGY 


WAYNE DENNIS 
University of Pittsburgh 


Scientific child psychology is commonly said to have begun with the 
publication of Preyer's Die Seele des Kindes (36) in 1882! and Hall's 
The Contents of Children's Minds (20) in 1883. Both of these were 1M- 
portant contributions. Preyer's general account of mental develop- 
ment in the first four years was based upon careful and detailed observa- 
tions of his son, to which were added data contributed by others an 
some comparative facts concerning behavioral development in animals. 
Hall's publication represented a very extensive attempt to determine 
the child's familiarity with a large number of relatively ordinary objects 
and concepts upon entrance into the first grade. This study of children $ 
concepts was oriented, on the one hand, toward a description of the child s 
thought tendencies, and, on the other hand, toward the educational im- 
plications of the limited experience and understanding of the child upon 
his entrance into school. 

Without doubt these two works are important landma 
development of child psychology. However, the history of science warns 
against attributing absolute originality or uniqueness to any contribu- 
tion. Every development must be the result of a developmental process: 
The trends which led Preyer and Hall to observe and record child be 
havior attracted other men as well, and Preyer and Hall were not the 
first to carry scientific enterprise into the realm of human infancY- | ., 

Both of these men were well aware of the contributions of bw 
predecessors. Preyer's book contains many references to the records a 
earlier observers, although, to be sure, none had left records as comple 
as was his own. Hall’s study was undertaken as a repetition © um 
earlier study conducted in Berlin by Bartholomái (3). Hall, like Le 
did a much better job than his predecessors. The works of Preyer ut 
Hall were outstanding not because they were novel in conception: 
because they were executed with great ability and thoroughnes* y. of 

It is the aim of this paper to survey the observational $ ^ 


r to b! 

1 The date of publication of this book is sometimes given as 1881. This p ate 
due to the fact that Preyer, in prefaces to editions beyond the first, gave pe 
of the first edition, and later writers have accepted the author's statemen ition j bt 
preface to the first edition is dated October 6, 1881, but a copy of the first eC” te 0 
Library of the Surgeon General’s Office, Washington, D. C., bears a pu 
1882 and bibliographical sources of that period give 1882 a 
fore, that Preyer, in writing later prefaces, erroneously 
face as the date of publication of the first edition. 


rks in the 


scems likely, there 
date of the first pre! 
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normal child psychology up to the time of the publication of Die Seele 
des Kindes and The Contents of Children's Minds. It is believed that a 
Survey of this sort will indicate the proper prominence of the two stud- 
les which are usually referred to as the first major studies in child psy- 
chology but will, at the same time, show that the early development of 
child psychology was continuous and gradual. 

Before beginning this survey, it will be advisable to attempt to define 
its scope. It undertakes to include only observational records of normal 
child behavior; it excludes theoretical treatments of the child. Since 
nearly every theory of human nature involves some consideration of 
childhood, to include general and theoretical material on the nature of 
the child would materially change the task which we have set for our- 
selves, A justification for excluding this material is found in the fact 
that theories of child nature have been given adequate historical treat- 
ment in several histories of philosophy and in histories of education, 
While no adequate survey of the early observational studies of children 
1$ available. : 

We pes excluded also all publications relating to the rd ef 
abnormal children. The early medical literature on many poli i E 
normal children such as hydrocephalics, microcephalics, anencepnallcs, 
idiots, and cretins is very extensive. Many of the reports contain some 


i i tion of these reports, 
Bene i t behavior. The orientation ' : 
we nee: ied than psychological, and it seems 


however, is on the whole medical rather I £ ns 

best to limit the present report to psychological studies of normal chil 

ren, ; i 

In organizing an historical survey, à Hus amps onmi 
rimary divisi ics | 

tate irn rice, em Sd within each division. In 


Made, the historical sequence being follo | Hn ed enit 
* present instance, a purely chronologica trea 


i herefore, seemed 
€ juxtaposition of entirely unrelated studies. It et ee miei es 
est to subdivide the studies by subject matter 


IStorica] tr : PF 
he ari ‘ere of the topics is more or i olen E uiae 

8raphical records are among the earliest, and are also pei tera 

It seems appropriate to place the infant biographies 

the Survey, 


h 


BIOGRAPHICAL STUDIES 


f his son 
i ioral development of 
Tiedemann’s account (47) of the eee) Tt appeared i AAA 


as : ; 
The es first infant biography to be = iabe rations were 1 781-1784 
ears covered by Tiedemann's sca 
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Three years of occasional observations were reported within the space 
of 25 printed pages. 

While Tiedemann was the first to publish a record of the develop- 
ment of an individual child, at least two other persons kept such à 
record at an earlier date than did Tiedemann. The two earlier diarists 
were Heroard and Pestalozzi. Heroard was a physician in the French 
court who was assigned to care for the health of the youns prince who 
became Louis XIII. The physician began his duties at the birth of the 
prince in 1601 and continued them for many years. Heroard kept à jour- 
nal in which he recorded the chief events in the court, his own activities: 
and also, in some detail the health and development of his young charge: 
Thus, while the developmental data concerning young Louis XIII are 
mixed with other matters, they are relatively complete. Heroard's 
journal, edited by Soulie and Barthelemy (21) was not published unti 
1868. It has furnished the basis of a recent book by Crump (7), entitle 
Nursery Life 300 Years Ago. Although Miss Crump cites some 0 the 
behavioral data, her chief interest lies in picturing the life of the court, 
rather than the development of the dauphin. 

Pestalozzi, in 1774, kept for a period a diary in which he recorded 
his attempts at the education of his four-year-old son and made some 
record of his son's behavior. None of this diary seems to have been pub- 
lished in Pestalozzi's lifetime (1746-1827). Parts of it were containe 
in Niederer's Notes on Pestalozzi (32) whose date of publication was 
1828. 

Several studies of individual children made in the first half of 
nineteenth century lay unpublished for many years after they we 
completed. Bronson Alcott kept for a while a record of the € 
ment of a daughter born in 1831. This record, still unpublishe , 
probably contained in notebooks left by Alcott (30). A brief summ? 
of this record was presented by Talbot (46) in 1882. un 

Charles Darwin's diary of the development of his infant son: pier 
in 1840, was not printed until 1877 (10). Strumpell's longitudinal T vat 

of his child (43) was kept during the years 1846 and 1847, but di 
appear until 1880. 


s , i ; py 9? 
The earliest baby biography to be published in English wa Aem 
ndix to 2” 


American, Mrs. Emma Willard. It appeared as an append: P suction 
ican edition of Madame Necker de Saussure’s Progress? 
Its date of publication is 1835 (50). jal 


" e 
Sigismund's book (41), which contains some biographical kc eyer Ê 
appeared in 1856. Four other individual studies antedate Taine 
Die Seele des Kindes by one or two years. These reports were 


(45), Wyma (51), Sully (44), and Champneys (5)- 


_— 
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T " CE 
gi a data just summarized indicate that at least twelve biographical 
m rds were kept before Preyer undertook his observations of his son 

5 " H y 
ey indicate also an increase in frequency of records of this type in 


the years just preceding Preyer's study. 
LANGUAGE STUDIES 


m. x ond d imeem pe s this heading might also be 
c \ à e all of the early researches on 
infant speech are longitudinal records of one or two subjects. Many of 
the baby biographies previously cited contain some account of language 
ye sae However, none of those mentioned in the preceding sec- 
bi was solely concerned with speech, whereas that topic is almost the 
E usive interest of those to be discussed here. Schleicher (39), who 
vrote in 1861, seems to have been the first to record the speech develop- 
ment of a specific child. His report was followed fairly closely by those 
of Holden (24), Pollock (33), Vierordt (49), Egger (12), Schultze (40), 
and Humphreys (25). Since each of these was primarily a brief presenta- 
tion of individual developmental data, further description seems un- 


necessary here. 

4 Hun (26) described in some detail the invention of new words by a 

oe girl who understood English quite well. This new language 
e taught to her 18-month-old brother. 


NORMATIVE STUDIES 


sh Several early investigations may be grouped 
are the common characteristic of being interes 


f . B H 
a hanee of some age level, or of being interested i 
wis age of children at the onset of some developmental item. The 
earliest of these normative studi as that of Feldman (13), published 


in 1g ies w nent 
7 1833, Feldman was a candidate egree. At that time, 


it for the M. D. d L 
, Was customary to require each candidate to prepare à short disserta- 
he normal function- 


tion ; I 
.9n in Latin, Feldman chose to write concerning the 
d under the heading of the normal 


Ing 

wate, the human body, and include r 
e € of the body in infants, data on the onset of walking and talking. 

state sumably obtained his data from mothers’ reports, although his 
Sana is not clear on this point. He states that 2 rod 

Svents i i ts, but it seems scarcely possible ha 

piu oor end observe personally 


at he n obe 
the could have had the facilities OF the time tc 1 
eginni : :on in so many subjects. 
Fel Sinning of speech and of upright ocomotion ! 


man? E 
an? writes: 


together because they 
ted in the typical per- 
n determining the 


e original monograph 


f this work. Thi D. 


2 
n I å 
Say „am indebted n lation ol 
ail, $ ed to my wife for the translatio! E 
Able in the Library of the Surgeon General's Office, Washington, 
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If you inquire at what time children attain the faculty of walking, that thing 
I have observed in the case of 35 infants. Of these, 6 were able to walk at the 
41th and 12th months; however, the rest required 13 months, and some even 
18 months or a small part of the 18th month in order that this feat might be 
accomplished without aid. All concerning whom I have spoken enjoyed the 
best of health. ... I have found among the 35 infants that one of them at- 
tained the faculty of speaking the first word in the 14th month, 8 in the 15th 
month, 19 in the 16th month, 3 in the 17th month, one in the 18th month, 
and one in the 19th month. Therefore, the normal time for first speech in 
infants would seem to be the sixteenth month. 


It will be seen that Feldman's presentation of his data in regard to 
walking is very incomplete, but it is clear that the majority of his sub- 
jects were one year of age or older before walking alone. His distribu- 
tion of cases for the onset of speaking is more detailed, but he presents 
data on only 33 of his 35 cases. Nevertheless his study is notable for 
preceding by many years the next attempt to set up empirical standards 
for the onset of locomotion and speech. 

The first person to apply a uniform behavioral test througho 
large part of the life span was Quetelet (37) whose work appeared in 
1835. Quetelet is known for his contributions to methodology in the 
field of census records, social statistics and vital statistics. However 
his interest extended beyond what is usually thought of as comprising 
these fields, and he presents in his Physique Sociale observations Ol 
strength of upright pull and strength of grip of the right and the left 
hands from ages 6 to 60. His data are presented separately for the two 
sexes. Yearly age groups are employed from 6 to 21 years; there is 4 
25-year-old group; thereafter his subjects are grouped by decades. n 
each group, his averages are based upon ten persons of each sex. 
data show that males are superior in strength to females at every age 
level. The data also show the superior strength of the right hand 2 
compared to the left in dextral individuals. i d 

In addition, Quetelet presented data on the incidence of suicide a” 
of crimes at various epochs of life, including the period below ten ai 
The data are presented by 5-year step-intervals from 10 to 40 oer 
and by decades thereafter. Especially noteworthy is the high sents 
rate of Berlin youth between 15 and 20 years of age- The data for 
refer to the years 1818-1824. 

The first normative study of the newborn was tha 
(28), which appeared in 1859. Kussmaul, li 
cian. His experiments with various modes o 

im to believe that all of the sense organs were capable © 
wae unction at birth. He gave fairly adequate description rowed 
s of the neonate. This pioneer study W 


ut a 


t of Kussmatl 


f stimulating the 


gree of f 
repertory of response 


Wer E 
. © employed were essential 
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b ae 
«beri e investigations by Genzmer (19) and Kroner (27), which 
s ut little to the findings of Kussmaul. 
Mee Sd studies of neonatal behavior may be called normative only by 
studies = os the meaning of norms, yet it is true that these 
Bat: e. esigned to investigate the characteristics of an age period. 
suction and Biedert (4) were interested in measuring the force of 
aieea i or negative pressure, of the neonate's nursing response and de- 
Rants pparatus which they used for that purpose. Donders (11), 
are 2 and Witkowski (38) and Cuignet (8) made observations 
he ocular responses of the neonate. Moldenhauer (31) tested the 


caring of the young infant. 


THE CONTENTS OF CHILDREN’S MINDS 
erved as models for Hall's work by 


The studies to be reviewed here s 
duction to his investigation gives 


th ; su 
E. same title, and Hall in his intro 
excellent account of his predecessors’ work. 


The first study, which was conducted in Berlin, 


tholomai (3). While the Berlin investigators described their problem 
tents of children's minds, the 


ur as that of determining the con ¢ e 
lu: or questions which they submitted to their subjects called for 
E. al types of performance. One procedure consisted in asking each 
Su Ject which of a number of common objects, such as a pond, a lake, 
Me hare, a squirrel in a tree, a hen with chickens, etc., the child 
th ever seen. In this portion of the study it was undetermined whether 
«i child who had not had these experiences might nevertheless have 
me comprehension of the phenomena gained from pictures or from ver- 
Sen i descr: iptions; the child was interrogated only with regard to his direct 
Perience with the objects. Un the other hand, some questions which 
ly tests of language comprehension. It was 

id knew what was 


etc. Other questions asked 
is father. The examining 
teachers to whom a circular or direc- 
earch procedures were far from ideal. 
s conducted in 1869 and whose re- 


2238 children just entering school were 
i y was the remark- 


was reported by Bar- 


etaim: 
en med for instance, wh 
nt by a dwelling, a sphere; 


sti p ! 
e tioned in regard to 75 topics: c 

ss Sag to which first graders are ign 
had seen 50% knew what was ™ 


Seen a sunrise, only 11% 
knowledge of s 


d attended ki 


Bitls 
to pp >celled in regard to 


ers. Children who ha! 
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better than those who had not attended, a finding which foreshadows 
an interest of the present day. 

Lange (29), in 1879, presented similar questions to 500 children 
entering the city schools of Plauen, Germany, and to 300 children in 
near-by rural districts. In regard to the concepts which he attempted 
to test, many of which were of the nature study sort, the country 
children proved to have a superior knowledge. 

G. Stanley Hall's study of Boston children (20) was begun in Sep- 
tember, 1880. It was first published in the Princeton Review for 1883, 
and was later republished several times. Hall drew up a list of 134 
topics about which his subjects were to be interrogated. It is not within 
the province of this article to describe Hall's study in full. We may note 
in passing that Hall’s contribution in this field was chiefly that of im- 
provement in method. Instead of utilizing ordinary classroom teachers 
whose methods of examination could not be controlled, Hall employed 
four kindergarten teachers who were trained to uniform procedures o 
questioning, and who frequently met with their director for discussion. 
In Boston, as in Berlin, the percentage of ignorance concerning simple 
concepts was surprisingly high. 


GALTON’S "INQUIRIES INTO THE HuMAN FACULTY AND Its 
DEVELOPMENT" 


That Galton's work is not usually given a prominent place in the 


development of child psychology is probably due to the fact that this 
field was not his primary interest. Itis true, nevertheless, that he shou 
be credited with several contributions to this area of specialization 
Although several of his writings bearing upon childhood were include 
in the Inquiries Into the Human Faculty and Its Development which was 
published in 1882, their first date of publication in most instances WaS 
earlier than that year, since Galton’s book consisted for the most part 
of reprints of articles that had previously been presented in scientific 
and popular journals. 

In 1875 Galton (14) published the first scientific study of the psycho 
logical development of twins, thus initiating one of the major types ° 
investigations of nature and nurture. Galton’s method consisted o 
inquiring concerning twins who in childhood were very much alike t 
determine whether or not they became more unlike after they le 
their homes and entered different environments. Conversely, he ene 
information concerning twins who in the earliest years Were exceedingly 
different, to learn whether or not under the continued influe 


same nurture they became more similar. His materials were 


eM 
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by sending "circulars of inquiry" to persons who were either twins 
themselves or were relatives of twins. 
. Galton found in regard to these twins who were remarkably alike 
in childhood that in some cases ‘the resemblance of body and mind con- 
tinued unaltered up to old age, notwithstanding very different condi- 
tions of life." Other cases showed divergence with age. He felt that the 
continued close similarity of some cases despite different environments 
showed that the divergences of other twins were due to illnesses or to 
different original endowments which only became apparent in maturity. 
The second group of data, derived from twins who were originally dis- 
similar, likewise led Galton to stress the importance of nature, since he 
found that no matter how similar the environment of the two, they re- 
mained markedly different. Galton wrote: “There is no escape from 
the conclusion that nature prevails enormously over nurture when the 
differences of nurture do not exceed what is commonly to be found 
among persons of the same rank of society and in the same country." 
In Galton's “Psychometric experiments," first presented in 1879, 
(15), he employed himself as subject in studying the association of ideas. 
It may seem odd, therefore, to cite this research as a contribution to 
child psychology. One of Galton’s tasks, however, was to attempt to 
determine when each association had been formed. In many cases this 
was not difficult, since the date of formation of the association could 
be established with certainty. Galton found that of 124 associations, 
39% of them were formed in boyhood and youth, 46% in adult years, 
and only 15% could be attributed to recent events. On the basis of 
these results Galton called attention to the importance of childhood in 
the formation of associations between ideas. 
“Statistics of mental imagery” was the title o 
in 1880 (16). In pursuing his interest in imagery, 
known, devised rating scales for several aspects of imagery. Visual 
imagery was rated by 100 English men of science and by 172 Charter- 
house school boys. Vivid imagery was found to be much more common 
among the boys, and boys were more able also to call up colored images. 
Galton believed that the habit of abstract thought resulted in the sup- 


pression of visual images. 
The study of imagery W 
tion of number forms (17). 
located each number within a c 
of his informants were certain t 
numbers had been present, unchange 
finding led Galton to ask teachers to ques 


f a report by Galton 
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this matter. Among subjects of about 14 and 15 years of age, Galton 
found that clear-cut number forms appeared in about one case in 20 
in boys and more often in girls. 

On the basis of the number of contributions to the study of mental 
development which are contained in Inquiries Into the Human Faculty 
and Its Development, it is our belief that this publication should be 
grouped with those of Preyer and Hall as one of the important early 
publications in child psychology. 


MISCELLANEOUS STUDIES 


In addition to the studies previously discussed, which were capable 
of grouping, a few remain, each of which stands alone within the epoch 
under consideration. We shall mention these briefly, following the chron- 


ological order. 


Miss Elizabeth Peabody (1) in 1835, 1836 and 1837 published accounts of 
the procedures of Bronson Alcott in his remarkable Boston school. Alcott 
stimulated student particiption in the educational situation by holding ‘‘con- 
versations” with children. Miss Peabody recorded, as nearly verbatim as pos- 
sible, many of these conversations. The result was a detailed account of child 
reasoning and child response in Alcott's school. 

Darwin's "Expression of the Emotions in Animals and Man" (9) contained 
some observational material on emotional expression in children. Many o 
these observations were made on his own children; we have seen that his inter- 
est in child behavior began as early as 1840. Some observations were gathered 
from friends and correspondents. . 

Clarke (6) described certain images which commonly appear in childhood 
but which in most cases disappear with age. It seems likely that he was referring 
to what are now known as eidetic images. 

Sikorsky (42) was the first to study decrements of performance during the 
school day. Investigations of this type later came to be known as studies © 
mental fatigue. 

Vierordt (48) devised a shoe which would record t 
with the walking surface, and pointed out differences 
adult walking patterns. 

Hicks (23) devised a method of recording prenatal movements b 
displacements of the abdominal wall. 


he contact of the feet 
between infantile and 


y measuring 
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IN THIS FIELD 
ssional affiliations of those who 
avior. The occupations 


PROFESSION 


It may be of interest to note the profe 
first contributed to our knowledge of child beh 
of some of the persons to whom we have referred could not be ape 
tained from the sources available to us. However, it is obvious t E 
physicians were the most frequent contributors to the v jene b 
scious science. Among the members of the medical profession men 
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Clark, Cuignet, Feldman, Genzmer, 


in our bibliography are Biedert, 
Moldenhauer, and Wyma. 


PI, Hicks, Hun, Kroner, Kussmaul, 
Hii ^ ang cg next in frequency. This professional group includes Al- 
oe ls eabody, Bartholomài, Lange, Pestalozzi and Strumpell. 
things a as a physiologist; Darwin a biologist; Quetelet was many 
ie so was primarily a statistician; Taine and Sully both were 
[en pher-psychologists. Galton contributed to so many fields that 

can scarcely be claimed by any. Preyer was a physiologist. Hall 


belongs to both psychology and education. 


Discussion AND SUMMARY 
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child behavior. In 1828, Niederer publish 
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of a father. Thereafter, there was a consistent increase in publication. 
Between 1830 and 1860 there were seven publications. The 1860's saw 
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this matter. Among subjects of about 14 and 15 years of age, Galton 
found that clear-cut number forms appeared in about one case in 20 
in boys and more often in girls. 

On the basis of the number of contributions to the study of mental 
development which are contained in Inquiries Into the ITuman Faculty 
and lis Development, it is our belief that this publication should be 
grouped with those of Preyer and Hall as one of the important early 
publications in child psychology. 


MISCELLANEOUS STUDIES 


In addition to the studies previously discussed, which were capable 
of grouping, a few remain, each of which stands alone within the epoch 
under consideration. We shall mention these briefly, following the chron- 
ological order. 


Miss Elizabeth Peabody (1) in 1835, 1836 and 1837 published accounts of 
the procedures of Bronson Alcott in his remarkable Boston school. Alcott 
stimulated student particiption in the educational situation by holding “con- 
versations" with children. Miss Peabody recorded, as nearly verbatim as pos- 
sible, many of these conversations. The result was a detailed account of chil 
reasoning and child response in Alcott's school. 

Darwin's “Expression of the Emotions in Animals and Man" (9) contained 
some observational material on emotional expression in children. Many O 
these observations were made on his own children; we have seen that his inter- 
est in child behavior began as early as 1840. Some observations were gathere 
from friends and correspondents. . d 

Clarke (6) described certain images which commonly appear in childhoo 
but which in most cases disappear with age. It seems likely that he was referring 
to what are now known as eidetic images. A he 

Sikorsky (42) was the first to study decrements of performance during t 
school day. Investigations of this type later came to be known as studies © 
mental fatigue. t 

Vierordt (48) devised a shoe which would record the contact of the = 
with the walking surface, and pointed out differences between infantile an 
adult walking patterns. 

Hicks (23) devised a method of recording prenatal movements b 
displacements of the abdominal wall. 


y measuring 


PROFESSIONAL AFFILIATIONS OF EARLY RESEARCH WORKERS 
IN THIS FIELD 


It may be of interest to note the professional affiliations of dust 
first contributed to our knowledge of child behavior. The occupa 28 
of some of the persons to whom we have referred could not be ar » 
tained from the sources available to us. However, it is uico Per 
physicians were the most frequent contributors to the a iic cuim 
scious science. Among the members of the medical profession m 
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dba m are Biedert, Clark, Cuignet, Feldman, Genzmer, 
crm icks, Hun, Kroner, Kussmaul, Moldenhauer, and Wyma. 
ar "nari in frequency. This professional group includes Al- 
Vieraita eabody, Bartholomäi, Lange, Pestalozzi and Strumpell. 
tines, t age a physiologist; Darwin a biologist; Quetelet was many 
ie » was primarily a statistician; Taine and Sully both were 
he c p er-psychologists. Galton contributed to so many fields that 

an scarcely be claimed by any. Preyer was a physiologist. Hall 


belongs to both psychology and education. 


DiscussioN AND SUMMARY 
two publications which appeared before 
liest publication referred to in the 
1 record of Tiedemann, which ap- 
to be credited with a very real 
's behavioral development was 


ene have reviewed forty- 
. It will be noted that the ear 
Preceding survey is the biographica 
Peared in 1787. Tiedemann deserves 
scientific priority. His record of his son 
thoroughly scientific in spirit. Although Héroard’s record precedes 
Tiedemann's, Héroard's interest was that of a diarist and historian; 
his account of the behavior of young Louis XIII was not oriented to- 
Ward the study of human development. 
d Following Tiedemann, there seems to be a gap of forty-one years 
uring which we have no record of any publication relating to normal 
Child behavior. In 1828, Niederer published part of Pestalozzi's notes 
of a father. Thereafter, there was à consistent increase in publication. 
Between 1830 and 1860 there were seven publications. The 1860's saw 
the appearance of four titles. A sharp increase in publication occurred 
during the 1870's, the total number during that decade being ee 
uring the years 1880 and 1881 alone, 12 titles are listed in our 5! iog- 


raphy. 

i The decade by decade comparison of publica 
pe in activity toward the development of mo bli 
ad been under way for some time before the pu ic 
3g Hall. These two scientists were apparently respon 
nfluences of the times as were lesser men, but since dia 
A abler workers their contributions were more pero that 
n this survey we have seen no cause to que “1409's Mind. 
Preyer's The Mind of the Child and Hall's ere d bea dmt à 

We: . ing betor z 
re the most notable contri une man Faculty and 


butions appear" the Hu 
He gested, however, that Galton's Inquiry ee E the afore-mentioned 
Development should be rated as almost ed butions to chil 
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BOOK REVIEWS 


WIENER, NORBERT. Cybernetics (or control and communication in the 
animal and the machine). New York: John Wiley, 1948. Pp. ii--194. 


This curious and provocative book is actually a series of essays on 
a variety of topics rather than an integrated work. It treats of such 
diversities as servo mechanisms, statistical mechanics, time, digital and 
analogue computers, physiology of the nervous system, perception, 
psychopathology, prosthesis, and the nature of society. On each of 
these seemingly unrelated subjects Dr. Wiener has something challeng- 
ing to say. These concepts and subject matters represent to the author 
sub-topics in the new discipline of cybernetics. This term, which is 
formed from the Greek word meaning steersman, is defined as embracing 
the entire field of control and communication theory as applied to both 
animals and machines. 

The author is professor of mathematics at the Massachusetts Insti- 
tute of Technology. It is not surprising, therefore, to find that approx! 
mately one third of the book treats the sophisticated mathematics for 
which he is famous. In this part of the work, which will probably be 
fully understood only by professional mathematicians, the author C€- 
velops the communication theory which he proposes as a basic theoreti- 
cal framework for treating the activity of complex machines and living 
organisms. Beginning this portion with a brief history of the statistical 
mechanics of Gibbs and Lebesgue, Dr. Wiener discusses the develop- 
ment of ergodic theory under Koopman and von Neumann, then speaks 
briefly of entropy and the Maxwell demon and finally passes on to thé 
development of a statistical mechanics of time series. It is the latter 
which the author regards as most appropriate in discussing animal be- 
havior. : 

Fortunately for the majority of readers, mathematical notation 18 
held to a minimum in the remainder of the book. Sometimes with great 
simplicity, but always brilliantly, the author hurries from discussions 
of the concept of time, through rather technical yet fascinating discus- 
sions of servo mechanisms and electronic computers, through essays on 
psychopathology and human perception to pessimistic prognostication® 
concerning the fate of man and his society. One is a little breathlese 
when he arrives at the end of the last chapter where, incidenta! Y> 
discovers a final note on how to construct a chess playing machine o 

There are many challenging speculations in this work which Ys 
provoke discussions and, it is to be hoped, research. But the aut xis 
most important assertion for psychology is his suggestion, m be 
peated, that computers, servos, and other machines may pro tab Term 
used as models of human and animal behavior. Although it is nOW 
stated explicitly, the implication is strong that the value to the Pes 
ologist and psychologist of these physical models lies less within 
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selves than in the ready-made mathematical theory which is now "ms 
ciated with them. It seems that Dr. Wiener is suggesting that psycholo- 
gists should borrow the theory and mathematics worked out for ma- 
chines and apply them to the behavior of men. 

Action upon such a proposal is, of course, contingent upon evidence 
of the degree of similarity between the analogue and that to which it is 
supposed to be analogous. A close correspondence between the two 
would suggest that there might be some profit in applying the mathe- 
matics of the one to the other, though no degree of correspondence short 
of perfect identity could relieve the researcher of his scientific caution. 

Obviously, since men can not as yet fabricate themselves out of 
vacuum tubes and circuits there is less than perfect identity between 
man and the models proposed by Dr. Wiener. Butis the parallel suffi- 


ciently close to be encouraging? 
Little evidence on this point is cited in the book other than examples 


of gross similarity between the action of men and machines. Indication 
is given of a parallel between voluntary activity and the response of a 
closed loop servo which responds, not to an absolute signal, but to the 
difference between a desired state of affairs and the state of affairs 
which exists at the moment. Certain forms of ataxia are recognized as 
suggesting certain conditions of servo instability, while others are said 
to resemble the response of a servo with an open feedback loop. The 
activity of the brain is compared to the working of an electronic com- 
puter and the suggestion is made that certain forms of human insanity 
have their parallel in computer breakdowns. 

titute almost the entire body 


These and other like observations cons 
of "evidence" that machines closely resemble men in their activities. 


Almost no factual data other than of a purely physiological nature 
exist to lend credence to this important postulate. 

In fairness to the author it must be pointed out that no attempt is 
made to "prove" the fruitfulness of t : 
nor to provide an exhaustive catalogue of man-machine correspond- 
ences. Dr. Wiener is content to develop the mathematical theory of 
machines and then to point out certain similarities between the re- 
sponses of these devices and those of living beings. He may regard it 
as the task of others to fill in the details of the pattern which he has 
sketched in outline. . 

Yet one cannot help but regret that more of the details were not at 
hand when the book was written. In the final analysis it is precisely 
these particulars which will determine whether the book should be 
judged as literature or science. At present one can only hope that 
cybernetics will prove to be as fruitful as it now stimulating to the 


imagination. 


he proposed analogous reasoning 
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TES author's ideas here set forth, in the light of her own clinical experience, 
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founder of individual psychology. She herself has specialized in neurology, psy- 
chiatry and psychotherapy, both in America and abroad. It should be helpful to 
physicians, social workers and teachers, as well as to educated parents and S 
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A HISTORICAL SURVEY OF PSYCHOLOGY IN 
THE SOVIET UNION! 


IVAN D. LONDON 
Tulane University 


To trace the historical development of psychology in the Soviet 
Union is to acquire the means for an understanding of its present status 
and content, about which so little is known outside the borders of the 
Soviet Union. In this paper it is proposed to survey the historical 
antecedents of contemporary dialectic psychology in the Soviet Union 
and to provide some idea of its basic tenets as presently constituted. 
While it were best to give a historical exposition which could take us 
down to the present, the purposes of this paper, as it so happens, do not 
necessitate a detailed account of events in the Soviet psychological 
world beyond 1936. The writer hopes, however, to supplement the 
present paper in the future with a historical account of developments 
subsequent to 1936 and so help overcome the unfortunate meagerness 
in knowledge of the workings and level of accomplishment of dialectic 
psychology in the Soviet Union. 

When one surveys from the vantage point of the present the course . 

} that psychology in the Soviet Union has taken since the days of the 
October Revolution, it is possible to mark out three periods in its 
evolution. ‘These periods in the evolution which eventuated in the 
~~ establishment of a dialectical psychology, distinct from all other schools 

of psychology, may be designated roughly as (1) the pre-Marxian 
period (1917 to 1923 inclusive), (2) the period of trial and “error”? 
1 The writing of this paper was initiated in p course p dip inet ined on Soviet 
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(1924 to 4930-1931), and (3) the period of the gradual emergence of a 
genuine dialectical psychology, a Soviet psychology, based on dialectical 
materialism and developed throughout in conscious accordance with its 
principles (1931 to the present). 

This conscious attempt to found a psychology whose methodology 
is that of dialectical materialism (101, 111, 114), as well as the need to 
cope with the specific tasks placed before it by the urgent demands of 
revolutionary reconstruction from Tsarism to socialism, determined in 
large measure the particular development that dialectic psychology 
came to exhibit (137, p. 21). Teplov, writing as a Soviet psychologist, 
therefore states that ‘‘the history of Soviet psychology is really the 
history of the Soviet psychologist’s mastery of the Marxist-Leninist 
methodology . . . [and that] each new step forward of Soviet scientific 
psychology, each of its large-scale achievements, is witness to a new 
stage in his creative mastery of Marxism” (117, p. 9). 

It will be seen that the development of dialectic psychology in the 
Soviet Union was by no means a smooth and uninterrupted affair. It 
had its confused and muddled periods, as well as periods of clear direc- 
tion and steady progress; times also when in retrospect it scemed to be 
diverted far from its set aims and goals. But the eventuating result was 
a psychology, firmly set on dialectical materialist principles and on 
these bases, to all appearances, effectively functioning and forging 
ahead. 


Tue Pre-MARXIAN PERIOD 
General Observations 


The first period, dating from the revolution to 1923 inclusive, is 00° 
during which psychology in the Soviet Union may be seen as beset by 
the same well-known difficulties plaguing psychology in Wester? 
Europe and America. Attempted solutions of these difficulties tende 
to follow the same lines as in western Europe and America, an for & 
number of years, up until 1930-1931, Gestaltist orientations, Freudian- 
ism, Stern’s personalistic psychology, Adler's individual psycholog, 
and especially American behaviorism were from time to time more 
Jess in evidence (28, 127, 136). 

However, to overcome the crisis in psychology as d 
under the influence of the continuing revolution, Sovie 
during this period came to champion the need of a thorout 
construction of the philosophical foundations of psychology ! 
of a genuine Marxism. Although thig reconstruction W gan E 
o have fallen far short of actual accomplishment, 
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at that time, nevertheless, was that so-called idealistic psychology soon 
ceased to exert any real influence. In fact, its extreme speculative and 
metaphysical wings, represented by such figures as Losskii and Frank, 
disappeared altogether, and conceptions, which in Europe are asso- 
ciated with Husserl and the phenomenologists and in the Soviet Union 
with the names of Lopatin, Chelpanov, and others (7, Ch. 8; 137), 
were surrendered in favor of the doctrine of “scientific materialism” 


(65, 93). 

Instead, developments patterned after American behaviorism came 
strongly to the fore, especially in the guise of Bekhterev’s reflexology. 
At the same time, also, Pavlov (79), the successor to the work of the 
physiologist Sechenov (12, 39, 49), was given, through the personal 
intercession of Lenin, every opportunity to proceed with his task of 
laying the foundations for a neurologically based psychology despite 


the disruptions occasioned by the Civil War? (80). 


The Contributions of Blonskit and Kornilov 


In 1921 Blonskif in his book, Essay on Scientific Psychology (21), 
entered the revolutionary lists against the traditional psychologies, 
which he claimed reeked of idealism, and issued the first summons in 
behalf of a psychology reared on materialist bases. Although he was 
later criticized for his adherence to American behaviorism, his gross 
mechanism, and lack of dialectical materialist mastery, his attacks on 
"university psychology” were considered telling. Highly regarded also 
was his thesis of the social and class nature of psychological man. "Ina 
class society 'man in general'," wrote Blonskii, “is an empty abstrac- 
tion; for man’s social behavior is determined by the behavior of his 


class” (117, pp- 10-11). 

In 1922 Kornilov, in his book, The Theory of Human Reactions? (42), 
s which led to the disappearance of pre-revolutionary Russian 
gence of “materialist” schools was to be expected, given the 


3 The course of event 
an revolution and its planned and unplanned im- 


psychology and the emer, 
thoroughgoing character of the Russi 
pact affecting the whole of Russian society. S 
4 Blonskiy was a pupil of Chelpanov (28a), who was, along with Nechaevy, N. N. 
Lange, and Bekhterev, one of the foremost representatives of experimental psychology 
in pre-revolutionary Russia (7, Ch. 6). Of this group Bekhterev is especially to be 
noted for his program of “objective psychology” which he promulgated in a book (16) 
first published espousal of sim- 
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joined the fray against idealism and philosophy in psychology. Al- 
though a pupil of Chelpanov, he insisted on the complete separation © 
psychology from philosophy, both as regards method and object of 
study, so that it be included in the ranks of the natural sciences. Kor- 
nilov, in addition, championed an end to the academic cloistering © 
experimental psychology, demanding that it not shun to deal with psy- 
chological questions pertinent to life and the demands of “socialist con- 
struction.” 

In 1923, Kornilov issued a veritable call to arms in behalf of a 
Marxist psychology frankly based on dialectical materialism when, be- 
fore the First All-Russian Psychoneurological Congress held in Moscow: 
he presented his paper, "Contemporary Psychology and Marxism’ 
(43). This paper signaled the first open struggle between the new 
“materialist” psychology and the old idealist psychology, dubbed 
Chelpanovian. It led to the latter’s swift demise, thus bringing to a 
close the first stage in the development of a Marxist psychology. 


THE TRIAL AND ERROR PERIOD 


It was during the second period, dating from 1924 to 1930-1931, 
that Marxism may be said to have won for itself recognition as the 
proper methodological basis of psychology in the Soviet Union. But, 
as we shall see, this recognition was at best theoretical, since, by the 
end of the decade, its implementation in actual acceptable psycho- 
logical theory was judged not to have been satisfactorily accomplishec- 
However, several psychological schools did spring up, each claiming to 
be the true bearer of Marxism in psychology, each subsequently to be 
denied its claim. 


Bekhterev's School and Contemporary Criticism 


During the first decade of this century Bekhterev initiated the 
laboratory study of "associated reflexes,” using dogs and then human 
beings as subjects for special training. His method of "associate e 
flexes" is distinguished from that of “conditional reflexes” in that is 
made use of electrical skin irritation as the “chief stimulus" Gee 
“unconditioned stimulus”). This stimulus, when applied, provokes ir 
defensive movement. When combined with some originally indiffere » 
stimulus, an "associated reflex” (Pavlov's "conditional reflex [fs 
developed in that the previously neutral stimulus can itself now € 
the same defensive movement (19). wt of "re 
For the many who insisted that the chief and direct objec an 


flexological” study was the human personality, itself, there were 
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advantages seen associated with the selection of the motor sphere in- 
stead of the secretory, as in the Pavlovian school. Shnirman (106), for 
example, listed the following: the facilitation of transfer of experimental 
results when human beings are involved; the inclusion of voluntary acts 
for study purposes; the possibility for an accounting of the effect of 
antagonists, that is to say, an accounting of not only the positive, but 
also the negative values of the reflex. In their enthusiasm the partisans 
of Bekhterev's approach did not hesitate to claim for it priority and [ 
superiority to that of Pavlov's. 

Pavlov consistently and intentionally operated within the frame- 
work of physiology in his classic investigations on the conditioned re- 
flex. Bekhterev, however, went to such lengths as to propose an inde- 
pendent discipline, reflexology, distinct from the physiological study of 
reflexes and from psychology which it was to replace. Bekhterev af- 
firmed the inaccessibility of the psyche to knowledge and, stressing 
instead "outer behavior," reduced conscious behavior to combinations 
of reflexes, essentially no different from those of animals. This stand 
put Bekhterev in the forefront of the movement in the direction of what 
afterwards came to be termed “vulgar mechanism" (93). 

Maintaining that the whole of psychology was rank idealism, inas- 
much as psychology made the psyche the subject-matter of its study, 
reflexology from about 1920 to 1926 acquired tremendous popularity 
and spread to the point of actually pushing psychology out of school 
curricula. In the words of Rubinshtein, "vulgar mechanistic material- 
relying on the support of the reigning *methodologists' of those 
years, triumphed, celebrating the feast of victory" (91, p. 68). 

Thus, according to Bekhterev's critics (81), in reflexology all be- 
havior was reduced mechanistically to elementary reactions of the or- 
ganism (in other words, to reflexes) so that by their proper combina- 
tions all properties and special characteristics of human activity, both 


psychic and social, could presumably be accounted for. But, in addi- 
tion to this, in the view of subsequent criticism, Bekhterev, in formu- 
lating the philosophical bases of reflexology, unwittingly committeed 
himself to an idealism as "vulgar" as his mechanism in that he reduced 
literally everything to "energy." . 
In the hostile eyes of his critics, 
reduction of matter and the psyche to energ 
two unfortunate consequences: (1) the mec 
sides general reflexology, there were established subdivisions 
ollective reflexology. However, Bekh- 
“bioconscious” discipline (19, 
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physical laws over not only to psychic but also to social phenomena; (2 


the energizing of the psyche in such a way that the psyche, which 
initially was denied in its specificity for man, came to be viewed as 
spread in the form of energy over the whole world—a notion that was 
denounced as partaking of the nature of an extreme panpsychism (91, 
p. 68). Yet, despite the extent and violence of this criticism, Shnirman 
was still able to claim in behalf of Bekhterev, one year before the 
"storm" which relegated reflexology into the limbo of discredited 
theories broke in 1930-1931, that Bekhterev “stands firmly on the basis 
of dialectical materialism” (105, p. 237). 


Kornilov's School and Contemporary Criticism 


The attack which Kornilov led on Chelpanovian psychology before 
the First All-Russian Psychoneurological Congress swiftly became the 
occasion of a three-cornered argument when the advocates of Bekh- 
terev's reflexology got to propounding zealously their views and assault- 
ing bitterly those of their opponents— Kornilov and Chelpanov. Korni- 
lov's adherents regarded Bekhterev's reflexology as "vulgar mechan- 
ism" fast degenerating into direct idealism; whereas the partisans of 
Bekhterev viewed the position of Kornilov's following as metaphysi- 
cally tainted in that the latter was stillinterested in “psychology,” which 
to the proponents of reflexology was prima facie evidence of the essen- 
tially idealistic character of the thinking of Kornilov and his group 
(93). 
The representatives of reflexology were most active in the propaga- 
tion of their views and, at the Second All-Russian Psychoneurologica 
Congress held in 1924 in Leningrad, they scored their greatest public 
success (117). However, despite the reflexological craze, toward the 
close of 1923, when a radical reorganization of the Moscow Psycho- 
logical Institute was pushed through, it was Kornilov who became the 
director of its new basic staff of young “progressively” oriented re- 


searchers. 
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vulgar materialism); the other is that of dialectical materialism. Naive mate- 
rialism frequently reverts to idealism, and each denial of psychical states as 
such or their identification with physiological processes is in direct contradiction 


to dialectical materialism. 
3. Furthermore, the essence of man is the aggregate of his social relation- 


ships, and the psychology of the individual is impossible that does not deal 


with the psychology of his class. 
4. Also to be kept in mind are the basic principles of dialectics: (a) continu- 


ous change, (b) the universal interconnection of phenomena, and (c) the 
“leaping” or saltatory emergent character of development. 


Kornilov's basic points, however correct they may have been in 
verbal enunciation, were subsequently considered as inadequate to the 
task of setting up a truly Marxist psychology and insufficiently under- 
stood by Kornilov himself; ostensibly as evidenced by his further elab- 
oration of them and the use to which they were put (11, 45a, 93). 
Nevertheless, they brought about the complete eviction of Chelpanovian 


psychology from the scene and, from 1925 on, the steady decline in en- 


thusiasm for reflexology. 

As his version of a 
actology"' (45, pp. 268-278), that is, 
which he responds to outer stimulati 
research was to be the study of the spee 
ments in his response to earlier conditioned signals. However, it soon 
became evident that between the grandiose aims of the contemplated 
Marxian psychology and the scanty program of its concrete content 
there was fundamental contradiction. The latter could not possibly 


lead to the former. 

Therefore, according to his many critics, Kornilov never did really 
manage more than a programmatic declaration of principles. They felt 
that “‘reactology,” the system of psychology which Kornilov devised 
to meet the demands of the thoroughgoing Marxism that he contem- 
plated, was also, in the last analysis, after all only an "eclectic mechan- 
istic formulation based on psychophysical parallelism” (37, 93), and 
thus could hardly be said to have fulfilled the program he had set for 

f fact, the Great Soviet 


himself and psychology in general. As a matter o 
Encyclopedia goes so far as to accuse Kornilov of “not having [at that 


time sufficiently] mastered the Marxist-Leninist methodology," so that, 
in its opinion, it was not surprising that his “reactological psychology, 
under the strong influence of Wundtian doctrine, was itself a victim of 


the coarsest mechanism interwoven with idealistic error” (45a, P- 323): 
, 1925 methodological articles of Kornilov one 
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of existence" and “perception as the subjective reflection of existence, 
whose reality is independent of consciousness,” these gave way in 1926, 
when his Textbook of Psychology (44) appeared, to a formulation which 
was criticized as deeply in error and anti-Leninist; namely, ‘‘the psyche 
and consciousness are the subjective reflection of physiological pro- 
cesses." In the last analysis, then, in reactology the psyche and con- 
sciousness, being mere duplications of physiological processes, could 
not be the true subject-matter of psychology, but only behavior. 

Many were dissatisfied with the banishment of consciousness from scientific 
psychology. There arose, therefore, a "struggle in behalf of consciousness.” 
‘As far back as 1925, when the first collection of research conducted at the Mos- 
cow Psychological Institute was published, Vygotskit protested that, "in 
ignoring the problem of consciousness, psychology, itself, cuts off access to the 
investigation of those problems of the psychology of behavior that are to any 
degree complex" (117, p. 15; 129). The problem of consciousness, however, ha 
to wait until the next decade before its acceptable resolution. Reflexology 
would have none of it; reactology could not cope with it. 


The reactological views of Kornilov were undermined not only by 
critical Marxian analyses, as already noted, but also in part by the ex- 
perimental investigation, begun by Vygotskit and continued by his 

co-worker, Solov'évyt, in which it was shown that a "dominant tenc- 
ency” of behavior, expressed in the form of a ‘‘conscious aim," can 
subordinate to itself unconditioned reflexes “infinitely surpassing it in 
strength.” Thus, if a person is required not simply to perform certain 
actions, but to perform them in spite of interference on the part of 
stimuli irrelevant to the task, then these stimuli are psychologica y 
no longer distractions, but obstacles to be overcome, and the rate 9 
basic work under such circumstances may exceed that under quiet con- 
ditions (117, p. 14). i 
This revealed the possibility of breaking past the limitations um- 
posed by the reactological views of man as a merely reactive being, that 
is to say, a being passively respondent to stimuli. Even reactologists 
were not long in beginning to emphasize the directed activeness of man. 
It should be noted that throughout Kornilov's reactology the sharP 
and militant mechanism of Bekhterev's behaviorism became more xd 
promising and eclectic in character. This was because reactology aye 
tially attempted a synthesis of the two extremes—behaviorism , les 
introspectionism, professedly in strict accordance with the princip 
of dialectical materialism (45, 93). ard 
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In this he was affirming the principle of the unity, but not the identity 
of the physical and the psychical (106). H 

Kornilov came then to believe in the possibility of establishing a 
demonstrable causal coherency between physical phenomena; but bes 
tween physical and psychical, or between psychical and psychical, he 
averred its impossibility. Thus, he saw the "objective" method as ‘ths 
only scientific method. The method of introspection,” though by no 
means ruled out, could offer only subjective description, which without 
concomitant control through objective data, was conceived as having 
nothing to offer of scientific value. In other words, psychology was, 
after all, the science of behavior; but behavior, itself, in the last anal- 


ysis must be regarded as a combination of reactions responsive to ex- 


ternal stimuli. 
Kornilov viewed these reactions as energy processes and strove to 


relate them to inner psychic phenomena. However, his critics regarded 
smacking too much of idealistic psychophysical parallel- 
conception. This theoretical perform- 
ance of Kornilov's, therefore, could not be successfully carried off. 
Kornilov, despite good intentions, was regarded as having failed, along 
with Bekhterev, to establish a “truly dialectical psychology—a psy- 
chology adequate to the demands of "socialist construction"—and 
"after sharp discussion in 1931 he acknowledged his former errors” 


(45a, p. 323). 


his efforts as 
ism and as too mechanistic in 


The Influence of Pavlov and Ukhtomskit 


In 1923 Soviet psychologists were confronted with the challenging 
task of incorporating and utilizing the brilliant discoveries of Pavlov, 
presented in his book, Twenty Years of Objective Experimental Study of 
the Higher Nervous Activity of Animals (82). This was ably performed in 
Kravkov's Essay on Psychology (51) and Dobrynin’s Introduction to 
Psychology (29). In addition, Vygotskil, basing himself on the Pav- 
lovian theory of the conditioned reflex, developed during the years of 
1925-1926 the ideas of Sechenov? on the reflexive character of the psyche 
(7, Ch. 5; 39). R P : 
Although Pavlov's theory was viewed as providing at last a physio- 


the materialist cannot acknowledge non-material phe- 
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al explanation for the various laws of association of the 19th 


logic 
a to this one 


century, the explanatory reduction of all psychic phenomen 
theory has been since frowned upon and regarded as incorrect in prin- 
Furthermore, the Pavlovian theory has been affirmed as inade- 
or even an understanding of the physiological bases of human 
let alone its psychic aspects (1; 30; 34; 36; 80; 91, pP- 462, 


ciple. 
quate f 
behavior, 
500). 
However, Pavlov's immense prestige and authority induced too 
credulous and uncritical an attitude toward physiological work in gen- 
The result of this hyper-enthusiasm for anything physiological 
attempt, on the 
a 


eral. 
was a rash of rather impetuous generalizations and an 


part of physiologists, to trespass onto other fields and, conversely; 
simultaneous eagerness, on the part of the workers in these invaded 
fields, to "physiologize" or "'biologize" their own subject-matter of their 
own accord. The very existence of psychology as an independent dis- 
cipline was threatened and, as already noted, reflexology, in particular, 
did actually manage a partial displacement of psychology (28, 93). 
Mention should be made of Ukhtomskii (122a) and his ‘‘theory of 
the dominant” (50, pp. 33-67; 122) which exercised a great influence on 
Soviet psychological thought. From 1923 on, this Soviet physiologist 
developed in a series of papers and articles the idea of the “dominant” 
as a seat of temporarily heightened neural excitation, deflecting to it- 
self stimuli from other possible seats and thus dominating and deter- 
mining the joint action of neural centers. Ukhtomskii was led to this 
theory by such observations as the following: (1) A cat, surprised in 
the act of micturition by a dog, seems rooted to the spot, para « 
by fear but by its momentary activity; (2) electrical stimulation of the 
motor cortex of an animal with a full rectum will not lead invariably to 
movement of the corresponding extremities but frequently to defeca- 
tion. Only when the rectum has been emptied will clectrical stimulation 
of the motor cortex always evoke the customary motor responses. , 
many the theory of the dominant seemed to bring nearer to realization 
a physiological explantion of goal-directed behavior and, according : 
Dobrynin, that of attention. In any event, the theory was effective ! 


À A : : rk 
as much as 1t at the very least, stimulated much experimental works 


(36, 125). 

Vygotskiv s School and Contemporary Criticism 
928 the problem of development attracte 
as the subject of special attack by a £ 
yng them Luriià and Leont'ev, headed by VU his- 
joint efforts there emerged their “theory of cultura 


Beginning in 1 


m 


e 


A SURVEY OF PSYCHOLOGY IN THE SOVIET UNION 251 
" in which they strove (1) to make the foundation 


stone of a Marxian psychology the principle of development, (2) to ap- 
proach the process of psychic development dialectically (66), that is, to 
stress the qualitative distinctiveness of its several stages, (3) to explain 
man’s psyche as a historical developmental product, and (4) to discover 
the explanatory principle behind the higher psychic processes such as 
logical memory, voluntary attention, thought, and speech. 

The originators of this theory devised original methods of experi- 
mental research on the development of these processes in children and 
established a series of special laws and facts of a most interesting kind 
(58, 64, 77). Examples of their work are well represented in Leont'ev's 
monograph, The Development of Memory (57) and Vygotskii’s Thought 
and Speech, published posthumously in 1934 (132, 134). 

However, the originators of this theory were criticized as having 
fallen, like Kornilov previously, into the same trap that "characteristi- 
cally” ensnares the “bourgeois” psychologist: namely, the indefensible 
eating one’s own particular psychology and a correspond- 
d system on the basis of isolated laws and scraps of data 
pecial researches. 
hus tried to show that "cultural development 
ption and assimilation of those behavioral 
yment as a means for carry- 
e proverbial "tying a 
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consists essentially in the ado 
modes based on 'sign'-utilization and emplo 
ing out this or that operation" (117, p- 16). Th 
knot so as not to forget” became in its own way 
each and every "cultural" operation. There was dra 
between "natural development" (e.g. the organic developmen 
memory) and “cultural development" (e.g the mastery of symbolic 
methods of recall). In addition to the above, the completely abstract 
and formal approach of this theory to the problem of the historical 
development of the psyche came under fire—an approach which was 
criticized as overlooking the concrete course of social development, the 
class struggle, etc Also censured was its overly abstract approach to 
child thought with its consequent neglect of his concrete social environ- 


ment, his concrete life and activity (91, pP- 338-339). 
T taken to task on yet another score. They 


Vygotskit ahd Luriia were 
were accused of exhibiting a failing in common with that of many other 
Soviet psychologists of that period—the uneritical borrowing of all 
manner of theories from the “bourgeois psychologies- "à practice 
which by 1929 was very much in evidence. This practice was, indeed, 
a fact not to be glossed over, however insist stinctions 
finely drawn between Soviet and "bourgeois 
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spirit. Blonskii,? for example, was a behaviorist of Watsonian deriva- 
tion, however much he opposed Soviet to American behaviorism, as 
when he remarked in 1925 that the latter "never would dare and hence 
could not give a correct explanation of American bourgeois behavior 
and disclose [for instance] the true essence and motives of American 
senatorial behavior” (117, p. 17). 

Also placed under especial attack was the wide-spread opinion, €X- 
pressed specifically by Vygotskii, that (1) it was necessary to distinguish 
in foreign psychological schools two aspects: (a) “the factual basis” an 
(b) "the theory constructed thereon" ; and (2) the denial of the second 
need not necessarily lead to rejection of the first. It was pointed out by 
critics that even among men of the greatest integrity “the factual basis" 
has its content in effect determined by the "theory." These critics 
maintained that the psychologist can always "extract" from out the 
manifold welter of reality precisely those psychological “facts” which 
his theory demands and, further, that even among individual ‘‘bour- 
geois” psychologists this “factual basis” has often been looked upon a$ 
erroneous and displaying more the essence of prejudicing interpretation 
by a given school than the essence of neutral fact (117). This criticism 
was especially damning in that psychological theories were and even 
now are considered as having philosophical bias and political leaning, 
so that "bourgeois" psychologies are ipso facto reactionary in essential 
character. 

General Observations 
veral am- 


ceptions, 
anism, 


During the second period there were, as we have seen, se 
bitious attempts at the construction of general theories, con 
and systems—often under the banner of an aggressive mech 
but all judged unsuccessful from the dialectical materialist point of 
view. At the same time a significant increase in the range of psycho- 
logical work took place, particularly in child, pedagogical, and indus- 
trial psychology. An independent journal, Psikhologiia, was establishe 
for psychology, many collections of experimental work were published, 
and a special section accorded it in the philosophical faculty of higher 


educational institutions. 
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"de i a generalizing character appeared; for example, 
; | Text of Psychology, Expounded from the Point of View 
of Dialectical Materialism (44), Vygotskii's Pedagogical Psycholo 
(130), Basov's Methods of Psychological Observation on Children (18)5, 
Blonskii's Psychological Essays (23)", and Dobrynin's Introduction i 
Psychology (29). 

. Despite the great preoccupation with dialectical materialist prin- 
ciples by Soviet psychologists, only one “materialist” proposition was 
subsequently considered as rightly taken—the psyche is a property of 
highly organized matter. But the recognition of the psyche as a property 
of the brain was in itself insufficient for the establishment of a Marxist 
psychology in as much as there was no real idea of what this property 
should consist. As will be later seen, Lenin’s ‘‘theory of reflection” was 
to provide the clue for the dialectical psychologist (55, 56). 

Meanwhile, toward the end of the second period, in 1929 and 1930 
the center of research at the Moscow Psychological Institute was 
focussed on the problems of social and collective psychology,” anda 
great deal of attention was thereto directed, not with much success, 
however, from the standpoint of subsequent criticism. Kornilov headed 
the section of general psychology, Rybnikov—child psychology, Borov- 
skit—zoópsychology (habit formation in animals), and Shpilrein— 
psychotechnic (‘‘professiography,” fatigue, industrial training; etc.) 


(28, 93). 

Toward the end of the 1920's enthusiasm for various "bourgeois" 
psychological schools had become strongly pronounced. Soviet psychol- 
ogists discerned affinities among these schools which made them seem 


“close” or “related” to Soviet psychology. Especial enthusiasm was 
' and, in particular, its 


reserved for German "holistic psychology,' 

Berlin branch, Gestalt psychology. However, criticism was unceasing, 

and it was driven home that the declarations of the Gestaltist school 
anism and the “alluring vitality” of its experimental 
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convictions had been blunted. For example, Borovskit in 1929 wrote: 


Some of our junior fellows are showing considerable interest lately in the 
German Gestalt psychology --- - Certain principles in it [seem to] conform 
to fundamental dialectical postulates. [But] the subjectivism and idealism, 
which are very prominent in Gestalt psychology, cannot... be approved 


of .... [As for American behaviorism], we shall never agree witha behaviorism 
trying to be purely empirical (for "no philosophy means bad philosophy"? 
or with a behaviorism tending to eclectics . . . . Behaviorism [needs rebuilding] 
on the philosophical basis of dialectical materialism (28, p. 184). 


These psychologists were chided for seeking '"'fellow-travelers" among 
bourgeois psychological schools instead of devoting their energies to- 
ward the construction of a truly Marxist psychology. 

Among the positive achievements of the second period the develop- 
ment of particular experimental methodologies must be included—that 
which even subsequent criticism concedes. To be singled out for special 
mention are Lurifa’s method for the study of emotional states (68, 69, 
70, 72, 76); the methods for the study of memory, thought, and speech, 
devised by Vygotskii’s students: Sakharov (99), Leont'ev (57), Zankov, 
and others; Basov’s observational method with preschool children (15, 


106); and Rybnikov's "biographical method” involving the use of 
diaries, autobiographies, etc. 


Crisis,  Housecleaning," and Reorientation 


Despite its accomplishments, however, toward the end of the second 
period psychology suffered great loss in prestige and sacrificed the greater 
part of ground previously gained. There were several reasons for this 
retrogression and the chief ones were perhaps bound up with psychology 
itself; for the dominance in it of certain mechanistic tendencies are nOW 
seen as having entailed two unfortunate consequences: one which led 
directly to the liquidation of psychology itself as a science (via reflexol- 
ogy) and the other which led indirectly to the same result by bringing 
about in psychology a state of general sterility (93). 

The second period in the development of psychology in the Soviet 
Union may be viewed as terminating in 1931, at which time there was a 
culmination in the sharp crescendo of discussions directed against the 
dominant mechanistic tendencies rooted in the then reigning psychol- 
ogies: reflexology, reactology, and related conceptions. The result was 
their demise; for even where reflexology and related views seemed most 
vigorously to do battle with the “idealistic” psychologies in behalf of a 
Marxist psychology, these were subsequently perceived as mere strug- 

gles between schools within the framework of traditional “bourgeois” 
psychology. 
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cannot be understood apart 


t in Soviet psychology 
leration within the larger 


from those developments undergoing acce 
framework of Soviet society itself. In a very real sense 1929 must, in 
fact, be regarded as the “year of the great divide” (113, p. 264) For 
psychology as for the rest of the Soviet Union, because in this year de- 
cisions were made by Stalin and the Communist Party which were to 
have long enduring effects throughout the whole of Soviet society. 
Prior to 1929 one might say that sundry “capitalistic” elements and 
influences had been more or less tolerated; subsequently they were to be 
studiously, and in many cases ruthlessly, weeded out—all this as the 
passionate accompaniment of the adoption of the famous five year plan 
calling for the establishment of heavy industry and the collectivization 


of farms in the Soviet Union. 

The struggle against “reactionary” influences was not long, there- 
fore, in finding expression on the philosophical "front" and led to a 
busy reinvestigation, under Marxist-Leninist principles, of basic and 
derived ideology, Asa matter of fact, throughout 1930 there took place 
among Soviet philosophers much discussion whose aim, at Stalin's 
suggestion, was two-fold: (1) the liquidation of "anti-Marxian" ideal- 
ism, on the one hand, and vulgar mechanism, on the other; and (2) the 
development and exploitation of the philosophical heritage bequeathed 


by Lenin (117). 

As on the philosophical front, so on the psychological front. From 

1930 to 1932 much discussion scale was in evidence. 
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regarded as critical for the formation of a dialectical psychology) ar ea 
were hammered out the most important of the propositions deemed essentia ag 
the establishment of a truly Marxist psychology, that is to say, a psychology 
securely founded on the principles of dialectical materialism, as first wed 
by Marx and reworked by Lenin. The history of dialectic psychology is one 
which criticism and samokritika have played no mean roles (117, 137). 

Two insistencies underlay all these discussions. Baldly stated, they 
were a call for (1) a heightened sense of “Bolshevik party-spirit™ 1 
Soviet psychology and (2) a Soviet psychology applicable to and com- 
mensurate with the urgent demands of “socialist construction.” Ac- 
cordingly, there arose a sharp reversal in the attitude of Soviet psychol- 
ogists to foreign psychological thought. Before, many were prone to 
look on foreign psychologies as somehow supplying the ‘‘last ue T. 
psychological thinking" or, at least, as worthy of serious study for Lo 
“factual bases" despite the untenability, from the Soviet point of view; 
of superimposed theories. Now it was declared evident that to follow 
the lead of any of the major schools of foreign psychology such as e 
haviorism, Gestalt psychology, etc., was to get one's self up a blinc 
alley from which there was no escaping (117). 

At the same time the point was driven home that, if psychology 
shunned the pressing problems of real life and applied practice, what- 
ever the claims may be that it might make for itself, it could not claim 
recognition as a science geared to Marxism-Leninism. Only in the 
course of solving those problems generated out of the very demands of 
everyday living and "socialist construction" was it possible, so it was 
averred, to establish in Soviet psychology a truly scientific theory. 

In the fundamental reorientation that took place in Soviet psychol- 
ogy, Lenin’s basic philosophy was decisive. Two works of Lenin's must 
be singled out as having exerted an especial influence. They are his 
Materialism and Empirio- Criticism (84) and his Philosophical Notebooks 
(85, 56), the latter first published posthumously in 1929-1930. Dis- 
cussions based on the material of these two books in particular led to 2 
reexamination of basic methodological problems in psychology and to 
the “conviction,” derived from the study of the latter work, that with- 
out thoroughgoing acceptance and mastery of the Leninist “theory of 
reflection” the construction of a Marxian psychology would be impos" 
sible (93, 117). 

The Leninist conception of the psyche and consciousness as the 
reflection of an objective world, it was believed, established at long last 
the means for "materialistically" substantiating the notion of psychol- 
ogy as the science of man’s psyche m aggressive opposition to the puni 
of psychology as the science of man's behavior, understood as an a£8 
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gate of outer reactions. Consequently, behaviorism in all its forms— 
reactology, reflexology, etc.—simply folded up under the vigor of the 
concerted attack to which it was subjected. 

Not only was behaviorism decried as leading to an inadequate 
psychology, but its “reactionary,” "political" character was held up 
ie excoriation. In behaviorism's “renunciation of consciousness” and 

approach to man as a reacting automaton" were discerned the not too 
subtle influence of capitalism; so that in 1931 Vedenov, summarizing in 
part the discussions on reactology, could declare: 

The whole reactionary character of [behaviorism's] approach to man is 
Behold the ideal of capitalism—man, the automaton, a 
one can compel as one wills! Behold the dream of capital- 
ists the world over—a working class without consciousness, without mentality, 
whose actions are supinely subject to the training whims of the exploiter! 
That is why it is in America, in that bulwark of contemporary capitalism, that 
this theory of the man-machine was so powerfully developed and so stubbornly 


maintained (117, p. 21). 

As a consequence, not only did consciousness and its directive and 
regulative roles in man’s behavior become questions of paramount 
urgency, but the very approach to the investigation of a majority of 


psychological problems was radically changed—an approach first 
hibited in researches on habit formation. From this 
began to emerge—a conception 
tially conscious. 


quite in the open. 
robot, whose action 


systematically ex 
time on, a new conception of habits 
which saw in habit formation in man a process essen 
The question, ‘‘What constitutes the role of consciousness in the process 
of habit-formation?'' came to pose the essential problem for the Soviet 
investigator in this area of psychological study (91, ch. 16). 

The psychological discussions, whose consequences we have been 
indicating, set the final seal on the fate of reflexology. Even prior to 


this time, things had not been going too well in the reflexological camp. 
Bekhterev had died in 1927 (107), and shortly thereafter, in the course 
of lively sessions marked by much criticism and self-criticism, a breach 


sundered the hitherto solid phalanx of his following. One group tread 
in Bekhterev's footsteps, espoused to a militant mechanism and a com- 
the other, having subjected the methodo- 


pounding of his "errors"; ha : : 
logical foundations of reflexology to critical analysis, sought a new di- 


rection. 

It was out of this latter group that there arose such active workers 
in behalf of a dialectical psychology as V. N. Miasishchev and B. G. 
Anan'ev. Anan'ev's paper, published in the journal, Psikhologitt, at 
the termination of the “reactological discussions and since pul 
a model of “Bolshevik self-criticism,” set forth so definitive a methodo- 
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logical critique of reflexology that to this day it is considered reflexol- 
ogy's fitting epitaph (117).'? 
THE DIALECTIC PERIOD 
General Observations (1931-1936) 


At the beginning of the third period in 1931 the prestige of psychol- 
ogy had, as already noted, fallen off to a very considerable degree—a 
state of affairs reflected by a drop in the number of psychological works 
published. However, it was felt that the liquidational tendencies of 
"vulgar mechanistic materialism” in relation to psychology were in 
principle, at least, overcome, and faith in the possibilities of a truly 
dialectical psychology sought affirmation in the wide world of practice. 
New centers for psychological work were established throughout the 
Soviet Union; e.g., in Georgia, in the Ukraine (Kharkov, Kiev, and 
Odessa), and in Leningrad (GertSen [Herzen] State Pedagogical Insti- 
tute and the Leningrad Brain Institute). Moreover, with the ambitious 
programs of industrial and educational expansion zealously undertaken 
by the Soviet Union, interest, already pronounced, in child, pedagogical, 
and industrial psychology continued steadily on the rise (93). 

Vygotskii’s theory of the cultural development of man’s higher 
psychic functions continued under discussion. The basic tendency of 
Vygotskii’s theory, whose ideas were often in close coincidence with 
those of the French sociological school (as represented by Blondel (20), 
for example) was to forward in psychology the idea of development and 
the principle of historicism. However, as previously pointed out, the 
theory was adjudged defective and interest in it gradually diminished. 

The challenge to comprehend the human psyche in the light of 
Lenin's theory of reflection pushed to the fore the problem of sensation 
and perception as constituting the initial step toward knowledge of the 
world. Previously, hardly any attention had been directed to this par- 
ticular problem in Soviet psychology. The reverse now became true. 
The sensations, especially the visual, were subject to intensive research. 
Out of this research there emerged, in the work of Kravkov (53) and 


13 Translations from the Russian into English are hard pressed to keep pace with 
the kaleidoscopic developments in the Soviet Union and unintentionally contribute to 
much erroneous misconception as to the past and present state of affairs in the Soviet 
psychological world. Thus, the reflexological and reactological articles by Shnirman 
(105) and Kornilov (45) respectively were already obsolete at their date of published 
translation in 1930, and the translators of Bekhterev's General Principles of Human 
Reflexology could make in 1932(!) the incredible statement that “reflexology is the 
dominant note in Russian psychology today, and the late Bekhterev is the guiding light 
of humanistic science in the U.S.S.R." (19, p. 15). 
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Lebedinskii among others, the theory of the "interaction of sensations” 
(8, pp. 259-260). This theory presented the thesis that by means of 
simultaneous excitation or induction from other sense organs the thresh- 
old of sensation can be lowered for a given sense organ (91, p. 156). 

In 1932 and 1936 there appeared the first and second editions of 
Kravkov’s book The Eye and Its Functions (52), lauded as one of the 
best generalizing pieces of literature on the psychophysiology of vision. 
However, not only were the theoretical aspects of visual sensation and 
perception pursued by researchers, but also their practical or applied 
aspects; for example, the work of Shvarts and Artemov on typography; 
work in the field of military camouflage, and so forth (117). 

There was also considerable research done on memory and thought 
processes. Besides the earlier indicated work of Vygotskii and Leont'ev, 
to be singled out for mention are the books of Blonskif, who had by now 
turned aside from pedology, an ‘‘anathematized" development of peda- 
gogical and child psychology shortly to be discussed. They are Memory 
and Thought (35) and The Development of Thought in the School Child 
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centrate their efforts on the study and generalization of experience and 
data gained by the "practical" workers in the schools. In other words, 
"bourgeois" pedagogical psychology was no longer to be slavishly aped. 
“Bougeois” pedagogical psychology was seen as too much the lackey of 
theoretical psychology; as too busied the applier of the latter’s conclu- 
sions, derived, not from the concrete pedagogical situation, but from 
the remote experimental laboratory. This relationship was now to be 
reversed. The psychologicallaws and properties of pedagogical situations 
were to be discovered and worked out in the very course of the peda- 
gogical process itself and not apart from it. An illustration of this new 
approach is Anan'ev's monograph, The Psychology of Pedagogical 
Evaluation (10), published in 1935. 


Pedology 


During the second half of the 1920's a special discipline, pedology 
(32), got under way, having gathered momentum with the publication 
in 1925 of Blonskii’s Pedology (24). This book played an important role 
in the development of this special discipline. In it pedology is no longer 
looked on as mere child psychology or as experimental pedagogy» but 
as a genetic science dedicated to the study of the growth of the child.“ 

Interest in pedology mushroomed and by January, 1928 the First 
Russian Congress of Pedologists had been held. Many prominent 
psychologists became interested in this new field, from among whom 
Blonskii, Basov, and Vygotskii are to be singled out. 

The psychological discussions of 1930-1931, which today are seen 
as heralding Soviet psychology as a single science whose various sub- 
divisions are unified by their common methodology and common prob- 
lems, strangely enough did not materially hamper the growth and spread 
of pedology. Despite increasingly unfriendly comment, it continued to 
flourish and managed somehow to bypass the fate accorded the other 
alledgedly mechanistic psychologies, that is, until 1936 when it was 
singled out for official censure in a special decree (84b). 

i Pedology came to be essentially an adaptation and development of 
ideas drawn from western educational psychology with its familiar 
paraphernalia of measuring techniques, tests, and so forth, and has been 
described, especially since 1936, as a “vicious pseudo-scientific bour- 
geois importation" (32). This special discipline is still regarded as hav- 
ing wrought incalculable harm to the Soviet school system, and the 

14 Lurifa in 1928 referred to pedology as “the science of the growing organism and its 


aapon complexes (including the somatic, psychological, and social peculiarities of 
the child)” (65, PP- 347-348). 
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work of the aforementioned psychologists is represented as precious 
years lost to worthwhile psychological research." 

It must not, however, be concluded that pedology fell into any sud- 
den disfavor and that because of official ukase. Throughout the years 
of its growth and spread in the Soviet Union pedology had been subject 
to unceasing attack by many psychologists particularly for its basis in 
f testology” which is in itself viewed as a pseudo-science. In books and 
in the journal, Psikhologiza, there are to be found not a little criticism 
of test-methodology. Kornilov, for example, in 1928 wrote that “in 
absence of solid preliminary substantiation the method of tests de- 
ctivity for which scientific character is claimed, but 
which is really] a game with jackstraws. ..." (117, p. 19). 

The 1936 decree on pedological distortions in the Soviet school 
system accused pedology of having monstrously perverted the educa- 
tional process through its regarding the development of the child and 
the growth of the personality as basically processes fatalistically pre- 
determined by heredity and the elemental action of a depersonalized 
environment. The decree, furthermore, discerned a “reactionary” 
character in the principal laws of pedology, since (1) these "laws" un- 
justly presumed the fatalistic dependence of a child’s destiny on bio- 
logic and social factors, that is to say, on the influence of a blind heredity 
and some sort of ‘irrevocable environment” and (2) these "laws" were 
o be in “howling contradiction to Marxism and to the whole 
alist construction, which was [seen as] successfully 
le in the spirit of socialism and rooting out the rem- 
nants of capitalism in the economy and consciousness of men” (84b). 

In addition, pedology’s chief method of investigation—that of 
“testology’’—was pronounced a “downright mockery and insult against 
school children” and "in opposition to both the real aims of the Soviet 
l and healthy good sense” (84b). Therefore, although the aim of 
pedology was ostensibly a thorough and many-sided investigation of 
the child, it was accused of having, in practice, degenerated into in- 
vestigation” of children by “standardized” tests. The position was 
vehemently maintained that such tests may outwardly lay claim to 
scientific character through their pretense of ability to "measure" 

ereditary potentialities, etc., but 
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science and, in principle, differ little from graphology and chiromancy. 
This view is still the current one and is strongly upheld (91, pp. 34-35). 

Pedology, then, despite Marxist trappings, was denounced as anti- 
Marxist and evilly akin to American ''testology"" which is viewed as 
serving, to all intents and purposes, the reactionary interests of capital- 
ism in that ‘‘tests happen to be an exceptionally convenient weapon 
with whose assistance it is possible, with seeemingly complete objectiv- 
ity and impartial operation with numbers, to bring about the desired 
selection of workers, to 'demonstrate' the psychic inferiority of the 
unemployed, to confirm the special talented nature of the children of 
the exploiting classes, and so forth” (117, p. 6). 

For similar reasons ''psychotechnic" was denounced. Initially psy- 
chotechnic was designed to perform the functions of all manner of ap- 
plied psychology. Subsequently, by psychotechnic was understood 
particularly that branch of applied psychology which was interested 
mainly in problems of rationalization of work and vocational selection. 
It, too, made great use of standardized tests and suffered the same fate 
as pedology at about the same time. 


DIALECTIC PSYCHOLOGY 
General Aspects 


The liquidation of the ‘‘pseudo-sciences’”’ of pedology and psycho- 
technic served to close the gap between theoretical and applied psychol- 
ogy and facilitated the spread of psychology into fields from which it 
had previously in large measure been debarred for the reason that these 
fields had been preémpted by the pedologist and the psychotechnician. 
The fact to note is that out of the pedological crisis and due to the in- 
creasing progress of psychology toward the goals set for and by the 
Soviet psychologist, there came to eventuate with the years a flourish- 
ing cohesive psychological system which has successfully weathered the 
exacting years up to the present. Furthermore, on the basis of available 
evidence, this system seems to have been quite fruitful in both theo- 
retical and experimental investigations as well as in practical applica- 
tions. The year 1936 may then be regarded as a real turning point in 
the development of a unified dialectical psychology; one which psychol- 
ogists in the U.S.S.R. could call a Soviet psychology. 

However, even before 1936 a definitely dialectical psychology was 
already noticeably well along the road to satisfactory formulation. 
Razran, for example, was able to report in 1935 the following: 

Until a very few years ago, dialectical psychology was largely theoretical 
and philosophical. It debated body-mind problems, sought theses and anti- 
theses, quoted Marx, Engels, and Lenin. In practice, it borrowed heavily from 
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A merican and German psychology, often little critical whether the borrowed 
techniques, methods, and even interpretations were in line with its own 
dei ne arp oes and social ena a This has changed now. Dialec- 
with a emeh aee Rl isea pie S apka hake 
Its main tenets are materialism, dialectics Feyetietugionl levele, = pee 
e r ; + PS} s, and above all 
an emphasis upon the social rather than the individual aspect of behavior. In- 
deed, consciousness itself, although admittedly biological in origin, is held 
to be in reality always social in content and the heuristic value of the conditioned 
response is not in interpreting social behavior biologically, but in modifying bi- 
ological behavior socially (86, pp. 23-24). 
4 Razran's observation, however, that dialectic psychology places 
above all an emphasis upon the social rather than the individual as- 
pect of behavior," risks misconception as to the current position, if it is 
taken to mean the opposition of the social to the personal and the lat- 
ter's minimization in the psychological scheme. Instead there is in 
dialectic psychology an insistent emphasis on the “unity of the social 
and the personal." Therefore, some amplification is in order. 

Razran's observation is at present applicable only if it is understood 
as meaning that the study of individual behavior cannot be undertaken 
in an ahistorical sociological vacuum, since the Soviet psychologist re- 
gards as fundamental the thesis that man, being a member of and in 
society, cannot be studied apart from it. And this belief is reflected in 
und distrust for approaches that sacrifice the individual at the 
altar of broad psychological generalizations which, in the very process 
of their initial conception and subsequent theoretical formulation, have 
ignored the concrete activities of concrete individuals in concrete situa- 
tions (62, 63, 97). In other words, Soviet psychology claims to be an 
individualized psychology, but the individualization is that of a social 
being and not of some abstract entity existing in a personal world 


“locked off” unto itself. h 
The stress on individuality partly accounts for the hostility of the 
“indiscriminate orgy of standardized 


Soviet psychologist toward that : 
tests" which drowns out the individual in a sea of ‘impersonal statis- 


tical averages” (93). Pedology and current American practice, as the 
Soviet psychologist views the latter, are particularly singled out as per- 
nicious examples of a profligate “testology” with its "fatalistic philos- 
ophy of static norms ignorant of and detached from the dynamics of 


mental development" (85). 
above, that what passed for dialectical 


azran has ointed out iat a 
E * the third decade, did in fact borrow heavily from 


his profo 


It is true, as R ; 
psychology, especially prior t 


iet psychology bes a social genesis to conscious- 
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16 Contemporary Sov: however, 
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the "bourgeois" psychologies. Gestaltism, for example, was very well re- 
ceived by Kornilov, who, duly recording his good-will in Psychologies of 1930 
(45), cited Gestalt psychology as ‘‘thoroughly dialectic” in its exemplification of 
the Marxian “law of mutual dependence of quality and quantity” (45, p. 255). 

Certainly, Gestaltism may seem, at least outwardly, to exhibit kinship with 
dialectical materialism. Gestaltism holds that out of the union of combinatorial 
parts new qualitites or wholes may emerge. Similarly, dialectical materialism 
maintains that mere quantitative accretion on one qualitative level can lead 
emergently to a discontinuous “leap” or skach6k to another level, qualitatively 
different from that which furnished the springboard for the saltation into 
novelty (111). 

Even Freudianism has been made to exhibit kinship with dialectical ma- 
terialism (87; 91, p. 67; 124). Freudianism makes much of the existence of 
opposing dynamic psychological mechanisms, their struggle, and tentative 
resolution on a new plane. Dialectical materialism likewise affirms “the unity 
of opposites” which, through mutual conflict, evokes the emergence of new 
characteristics (67). 

Yet the kinship suggested has been adjudged in either case to be at best 
superficial and rejected (38, 87, 100). Freudianism is decried as a particularly 
vicious theory serving as an “idealistic” vehicle for a reactionary decadent 1r- 
rationalism? (124). And Gestaltism is criticized for being not only idealistic in 
its theoretical conception, but also mechanistic in its operation (35; 91, pp- 58- 


59). 
Of the developing dialectical psychology Razran writes: 


The Soviet psychologist's dialectical materialism commits him to the recog- 
nition of dynamic opposing tendencies in psychological phenomena, and— 
what is even more important—to the emergence, through the conflict of these 
tendencies, of new qualities, emergents, or higher psychological levels. These 
new emergent levels of behavior—conscious, verbal, intellectual—are iiem 
ally supra-summative and demand new specific methods for their approac 
and study so that both structuralistic introspectionism and objective atomism 
are necessarily abnegated .... 

[Furthermore], in dialectical psychology emergence and function 
summativeness do not presuppose metaphysical configurational correlati 
are held to be merely results of the interaction of parts. The system 1$ thus 
more similar to the “reaction stages" and “higher units" of Woodworth than 
to those of Gestalt and of Wheeler and Tolman” (86, p. 20). 


The above observations appeared in 1935. In 1942 Razran sug- 


al supra- 
lates but 


17 Although in 1924 it was possible to have several papers presented by psychoan- 
alysts before the Second All-Russian Psychoneurological Congress, held in Leningrad: 
by 1930 the psychoanalytic movement in the U.S.S.R. came to a practical end and the 
publication of psychoanalytic material, which had already experienced increasing 
* difficulty for a number of years, ceased in toto. To this day the hostility to Freudianism 
and related conceptions continues unremitting. Freudianism is now either ignored or, 
if noticed, only to be criticized as an “idealistic, metaphysical, and reactionary science, 

isplaying] the methodological faults of biologism, psychologism, and individualism 
Pe] inn three defects characterizing its application to major aspects of man and 


society. 
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gested that, all in all, “current Soviet psychology may probably be best 
designated as a sort of Neo-Functionalism," adding, however, that he 
thought Soviet psychologists would “no doubt resent such an apella- 
tion” (85, p. 446). As a matter of fact, they apparently would, for the 
very reason that “the overcoming of abstract functionalism” (91, p. 3) 
is conceived as one of the major tasks of dialectic psychology. For in- 
stance, in 1943 Rubinshtein wrote the following:'8 
. Three problems seem to be possessed of an immediacy for psychology at 
this given stage, and their correct formulation, if not their solution, is one that 
is especially essential for progressive thinking in psychology. 
1. The problem of the development of the psyche involving, in particular, 
(a) the overcoming of fatalistic views regarding the development of personality 
and consciousness; and (b) the problem of development and learning [the prob- 


lem of development and education]. 

2. The problem of actional efficacy and consciousness; in other words, the 
surmounting of that passive contemplation which reigns in the traditional 
psychology of consciousness. And in connection with this, 

3. The elimination from psychology of abstract functionalism [which 
ms of activity by externally applying abstract 
{ the conditions in which activity takes 
to the study of the psyche and con- 
d consciousness are not 


approaches psychological proble 
principles without consideration o 
place"]; and the transition of psychology 
sciousness in concrete activity, in which the psyche an 
only made manifest, but also formed. 


The shift from the study of functions, 
study of the psyche and consciousness in concrete activity [promises to] enable 


psychology organically to draw near to the concrete questions of applied 


only abstractly conceived, to the 


practice . . . (93). 

In view of the foregoing and the basic principles of dialectic psy- 
chology, shortly to be enumerated and briefly discussed, it seems to the 
writer best to regard dialectic psychology as a school of psychology 
in many respects decisively different from others, instead of hazarding 
such an ambiguous designation as “neo-functionalism,” however useful 
it may be for suggestive comparisons or hints to assist those of us who 

18 In untranslated manuscript of Bases of General Psychology, intended for an Ameri- 
can edition. The first edition of this important work appeared in 1940 and was awarded 
in 1942 the Stalin Prize. The writer wishes to acknowledge his indebtedness to Helen 
Black of the Preslit Literary Agency for the opportunity of studying this manuscript. 

A second edition of this work (96) appeared in 1946 and was the subject of special 
di ion in 1947 during a psychological conference. While Kornilov concedes the book 

UN ntal" and “first-rate,” he reports that among the criticisms leveled at it 
EN paa aes astery of the dialectical materialist method, (2) conciliatory ap- 
ere Gee yi hologists and their theories, and (3) failure to incorporate a 
peasement of bourgeois psyc 117, p. 31). Leont'ev (61), 


bang ! 13: 
f the Soviet citizen (41; 47, p. 13; 

md ph E y "tasks" that await Soviet psychology as a result of the 

in the course 


t tics "debate" from which Lysenko emerged complete victor, also alludes to 
recent gene a 


B 2 í k 
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certain "'objectivist" deficiencies 1n 
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are anxious to form some idea of matters where there is so little informa- 


tion available in English. Razran rightly states that “the Soviet psy- 
chologist is really not just a laboratory experimentalist, a fact-finder 
who finds facts to be used somewhere, by some one, some time; [because 
he] is concerned . . . with the change and control of behavior in a par- 
ticular direction,!? [whereas his] American [colleague] is interested in the 
prediction and control of behavior without emphasis on the direction 
of the control" (86, pp. 19-20). Such a statement in itself is enough to 
signalize the presence of fundamental differences. 

It is important to realize that not only is dialectic psychology 4 
Soviet psychology, but it is the Soviet psychology. There is only one 
school of psychology in the U.S.S.R., and its creation, though admittedly 
far from complete, is regarded as already an outstanding accomplish- 


ment of great significance. 

Thus, we find, for example, 
referring to "our Soviet psychology" 
of the greatest general achievements o 
the establishment of theoretical unity within it at the same time that 
there thrives a tremendous diversity in its creative research . . ." [and 
that], “with full awareness of the historic role of Soviet psychology, it 
is necessary to note the fact that Soviet psychology is the first in the 
whole history of psychology throughout the world to have attained 
theoretical unity, a theoretical wholeness of psychological theory for- 
mulated on the basis of Marxist-Leninist philosophy." [Asa matter of 
fact], "there exists not a single field of psychology, in which there has 
not been made felt the great transmuting force of the philosophical 


Marxism-Leninism” (6, p. 56). And Teplov, summarizing 
remarks that, 


Rubinshtein (91) and Kornilov (46) 
and Anan'ev asserting that “one 
f Soviet psychology consists 1n 


principles of 
the achievements of thirty years of Soviet psychology 5 
“guided by the only true methodology—that of dialectical materialism, 
Soviet psychology opens an entirely new chapter in the history of psy- 

apters 


chological science; a chapter in relation to which all previous ch 
must appear as only prehistory” (117, p. 5). 


19 The Soviet psychologist affirms not only the unity, within consciousness, of the 
personal, but also, among others, the unity of (1) the ideological and the 
2) the moral and the psychological, and (3) the Weltanschaulich and the 
He asserts, furthermore, that no mischievous “objectivism” must blind 
hologist to the necessary “unity of psychology and political philosophy 


social and the 
psychological, ( 
psychological. 
the Soviet psyc 
(9, 94). 

In a recen 
parently exhibit 


will be ina Soviet society. I r 
mirrored and projected. 


beyond static analyses and 


man's direction is trans- 


tarticle by Rives (88), the fundamental interest in 
but what he should and 


ed in his declarative "study" of not what man is, r 
n other words, in psychology, not man but the “new Soviet 
It follows, then, that a “complete” psychology 
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A SURVEY OF PSYCHOLOGY IN THE SOVIET UNION 267 
The Basic Principles 


1. The principle of psychophysical unity. This principle exhibits the 
materialist monism of dialectical materialism and involves basically 
Lenin's theory of reflection. It postulates the unity of the psychic with 
the internal and the external; the psyche being considered as both a 
function of an internal organic substratum and a reflector of things in an 
external world existing prior to and apart from consciousness. Thus, the 
psychic properties of the brain—"'the highest form of organic matter"— 
are scen to effect the subjective reflection of a real outer world. 

'This principle, however, is not some pious formula that is to be 
dutifully invoked and then forgotten as is so often the case elsewhere. 
Shunning both parallelistic and interactional dualisms, the Soviet psy- 
chologist takes the principle, as he sees it, of psychophysical monism 
very seriously. Thus, for the Soviet psychologist in his investigation of 


sensations: 


The sensations constit 
in their functional dependence on th 


ute the object of study, being considered not merely 
e organ, but also in their relation to the 
object, whose qualities they reflect. In distinction to the purely gnoseological 
treatment of sensations, which is concerned only with its adequacy or inade- 
quacy to the object, [dialectic] psychology studies sensations as determined by 
the total life and personality of [an] individual [in action]. Thus, [dialectic] 
psychological research is concerned not only with the "stimulus" but with the 
object, too; not only with the organ but with the man (94, p. 186). 


2. The principle of the developmental unity of the psyche and the organ- 
ism, or the dialectic principle. This principle exhibits the famous dialec- 
tical triad of thesis, antithesis, and synthesis. Implied are the two 
unities: the unity of opposites in mutual interconnection and penetra- 
tion, and the unity of quantity and quality; the latter involving the 
saltatory skachók or emergent “leap” in the process of development (40). 
This principle has, therefore, been also referred to as the principle of 
evolution in its dialectical materialist form (94). . 

The dialectic principle is reciprocal in the sense that it regards the 
development of the psyche and that of the organism and its mode of 
existence as reciprocally dependent on each other. Phylogenetically, the 
development of the psyche is viewed asa by-product in the evolution of 
organisms, that is to say, as a derivative, though specific, component in 
this evolution. Thus, f evolution adaptive changes in the 
form of life and mode of existence are thought to induce modifications 
not only in the structure but also in the psychophysical functions of the 

= Sy; : but the two of them—structure and function—are con- 
nervo a sere oexistent in “complex unity, mutual interaction, and 
ceived always as Coe 
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as a member in and of society. This is taken as liberating man from the 
notion that he is fatalistically predetermined by some all-powerful in- 
eluctable heredity and depersonalized environment that somehow exert 
immutable effects in fixed sequence; for, according to this principle, the 
psychic properties of man are in action (education, work, social change, 
etc.) not only exhibited, but beaten into shape and transformed (62, 63). 

Neuropsychic activity is, therefore, held to be a reciprocal function 
of the organism's form of life. This leads to the statement of two neces- 
sary corollaries: (1) The psychic development of animals is determined 
primarily by biological laws, since animal forms of life are essentially 
biological; but (2), in contradistinction, man's form of life is not to be 
equated to that of the animal. It is social, historical, man-created, 
hence, on a qualitatively different plane; so that the origin and develop- 
ment of human consciousness, itself, is not sought in mere biological 
genesis, but is instead viewed as being, in the last analysis, determined 
by the organization of the ‘‘means of production," historically evolved 
and evolving in society. 

3. The principle of historicism. This principle reflects the historicity 
fundamental to dialectical and historical materialism. It is applied to 
the notion of the genesis of human consciousness in the historical process 
of man's social development. This, too, is a reciprocal conception. 
Man's social existence and organization, summarizing the long period of 
his historical evolution, has come to determine his consciousness, his 
behavior, the form of his thoughts and feelings. Conversely, his con- 
sciousness, thoughts, feelings, etc. have acted to condition his socia 


existence and organization. 

This implies that the psychology of man must in 
not only his biology, but also his sociology and, further, that he must De 
regarded concurrently as the end-product of long years of prior histori- 
cal development and so understood. A psychology that precludes es- 
sential interest in his social and historical development is considered as 
grossly inadequate for man’s proper understanding, for the reason that 
man’s social and historical development is looked on as making him 
subject to psychological laws qualitatively different from those charac- 
terizing organisms not thus influenced. It is felt that man’s psychology, 
though genetically based on that of his animal predecessors, is not, pen 
ever, for that reason reducible to it; for the very fact of man's social an 
historical development is thought to have brought a genuinely new a 
tor into being and established thereby a real discontinuity between ae 
laws governing animal behavior and those governing the specifically 


human.”° 
20 Thus, i 


chology in i ps 
chological orientation, 
fae of Lenin's theory of reflection and th 


sonality (4). 


volve into its theory 


n 1939 Anan'ev reported that, while the work of the Department of Psy- 
e Bekhterev Institute for Brain Research was defined by the latter's neuro- 
it aimed, nevertheless, to develop its research problems on the 
e historical-materialist conception of per- 


" 


A SURVEY OF PSYCHOLOGY IN THE SOVIET UNION 269 


It is held, further, that even among men, distinctly different laws of 
the developing psyche must obtain depending on the uniqueness of the 
social organization which includes these men as active members. Thus, 
the ‘‘class-historical” approach leads to the view that the psychology of 
the Soviet citizen, for example, and his laws of psychic development are 
essentially at variance with those of other cultures and that the study 
of these unique laws must form a special integrated chapter in any psy- 
ip" that claims to be a complete and truly Soviet psychology?! (60, 
4. The principle of the unity of theory and practice? (stated otherwise, 
the methodological principle expressing the unity of research and in- 
fluence; or, alternately rendered, the principle of research and applied 
action). This principle asserts the dialectical materialist unity of theory 
and practice and affirms the Marxian thesis that objectivity of knowl- 
edge does not imply passive contemplation on the part of some “neutral” 
investigator, for the reason that true knowledge about the world is 
gained not by passive inquiry and speculating about it, but by active 
intervention and changing it (109). 


The principle of the unity of theory and practice characterizes, there- 
y which consciously influences, as it 


fore, an experimental methodolog S 
s reflects the more general reciprocal 


investigates, the human psyche. Thi 
thesis that man, in the course of altering the world, thus can get all the 
more deeply to know it; and, in the course of more deeply knowing it, 
acquires thereby the greater chance the better to change it. 

The human psyche, which is to be investigated under the guidance 
of this principle, is not contemplated as a static entity bound by psy- 
chological laws to exhibit foreordained properties, levels of develop- 
ment, or fixed behavioral patterns because of past biological and en- 
vironmental factors (62, 63). The human psyche is regarded not as 
something that can, at the most, merely fulfill limited potentialities set 
in advance by a constricting inheritance, but as something which can 

21 Concrete manifestation of this view is in abundant evidence in the current glori” 

i “new Soviet man” (88). 
— dE is reminiscent of Dewey's pragmatism and Lewin's “action research,” 
though, no doubt, decided exception would be taken to any assertion of affinity = 
example, Smirnov denounces Dewey's ideas as those of a demogogue = most evi 
enemy of true science and democracy" (104, p. 120) and Se eee ese, re 
moderately, to be sure, against the a E paan idealism” of Lewin’s topologica 


ations (62; 91, pp- , 
om unu d article (94) this unity has been somewhat inadequately re- 
ES acis a ‘Gnvestigation and action (vozdetstvie),” the parentheticated original 
= Ls v aie translator was not quite sure as to how best convey the exact “action” 
ed ad inus the prefix voz, means generally "action"; vozdefstvie, however, 
connoted. Detstvie, e^ intent to influen ce." Moreover, the methodological. Ming 
connotes "action Wl d cited in the same article, makes the kind of "'action' intended 
drawn from pee hildre teach the children, studying them 
e chi 


very clear— "Study t 
(94, p. 184). 


n, teaching them; 
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create its own prospects and perspectives for future development 
through man's own efforts and activities, collective and individual, 
within his society; heredity and environment being viewed as only fac- 
tors which condition, but do not dictate, the character, never quite 
final, of the psyche which an individual may come to display. The 
words, the human psyche is in action not only made manifest, but also 
formed, is a refrain that with variations is sounded over and over again 
in many different contexts throughout the whole of not only contempo- 
rary Soviet psychology but also other fields of Soviet life (97). 

. 5. The principle of the unity of consciousness and activity.” This 
principle reflects the Marxian thesis of the uniqueness of man’s con- 
sciousness, the inception and development of which is held to be essen- 
tially the historical product of labor by man within the social frame- 
work. It is, in fact, the grand principle into which the four preceding 
principles converge. 

In this principle consciousness is conceived as formed through con- 
crete activity among men in society, and knowledge of consciousness 
can be gauged through the activities of people; that is to say, in this 
very activity consciousness is not only formed, but made accessible to 
public disclosure and study. Consciousness, therefore, is not the ulti- 
mately solipsismal concept that it is for so many, but is an indissoluble 
unity of the subjective and the objective. In other words, consciousness 
for the Soviet psychologist, does not exist in a self-contained worl 
locked off unto itself, but exists and is formed in unity and relation with 
the outer world which it comes to reflect both experientially an 


nitionally through man’s activity (95). 

Furthermore, every experience of a person, he feels, 
inescapably an experience of something, so that the very intern 
of experience reflects a mediated determination via that person's inc 
vidually grasped or ungrasped relation to the external world of which he 
isa part. Consciousness is, therefore, looked on as a unity of the direct 
experientially given and knowledge, the latter not necessarily in any 
way formalized or verbalized (91, p. 57). 

Knowledge consists in actively grasped relations 
y be then characterized as partaking of 
sciousness; in other words, it may be termed as conscio 
that which is ungrasped may be termed as unconscious, the ! 
in this definite sense. For example, an attitude may exist 1n 


d cog- 


is always and 
al nature 


hips. That which is 
the essence of con- 
us. Similarly; 
the latter only 
an indi- 


grasped ma 


r-emphasize the role of purposeful action and activity !? 


For example, perception is held to be generated out of ac- 

i 7 hile the stimulus may precede the perception, 
ivi to- and phylogenetically While h r i 
Sm a derivative of perception and indicates a high degree of conscious differenti- 
distinguish sensations is not a primitive but already a highly 
an (91, pp- 151-152). 
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vidual and not be known to exist by him, that is, he may not grasp (be 
aware of) his relationship to others and the surrounding world, although 
it may be very apparent to outside observers. In this contingency we 
may regard his attitude as one that is unconscious, that is to say, un- 
grasped by him, though effective in his actions. But an idea, on the 
other hand, cannot exist in an individual ungrasped, without semantical 
violence, and, hence, the notion of unconscious ideas existing at odds 
with conscious man and irrationally exerting power and direction within 
an individual is held to be untenable (2, 93). 

All this makes also untenable the concept of unconscious wishes, 
kept essentially intact, through the long years in some mysterious sub- 
terranean cavern from which to sally forth in one guise or other at ap- 
pointed occasions to grip and bedevil man; unconscious wishes which 
subvert and minimize the relevance of man's directed conscious be- 
havior and activity. For the Soviet psychologist, there is no Uncon- 
scious, although some aspects of behavior and action are unconscious. 
In other words, the adjective does not necessitate the substantive, not 


even for Freudian exigency. ew 

The concept of consciousness as d unily of the experiential given and 
knowledge, gained in action and activity, is one of great moment and 
consequence in Soviet dialectic psychology. In man the component of 
knowledge in consciousness is so vastly greater and more organized than 
could be possibly the case in even his nearest animal relative as to make 
distinguishable a qualitative difference between the psyche of man and 
that of the next highest on the phylogenetic ladder. Consciousness is, 
therefore, a peculiarly and specifically human attribute. 

Of even greater moment and consequence is the concept of the unity 
of consciousness and activity, which is, in fact, regarded as the keystone 
of Soviet psychology. The connection between consciousness and ac- 
tivity is by no means a mere formal device, but is regarded as a deeply 
intrinsic and reciprocally genetic relationship, whereby the one condi- 
tions the advent and development of the other. In other words, through 
the organization and fashioning of concrete activity it is held possible E 
influence the formation and essential structure of consciousness, itself; 


just as consciousness is held to influence the onset and subsequent un- 


folding of the directed activity that is peculiarly man's. Gon onk ac- 

tivity is, therefore, seen as the sine qua non of man's psy ce ogy. 
Accordingly, consciousness and activity are viewed as eing in wen 

most intimately and reciprocally related, and the one cannot be un as 

from the other. For example, just as consciousness is relate 

pes hy aoe so is external behavior related to consciousness; 

o exte , 

as the analysis 0 


f behavior reveals that the outer aspect of action alone 
does not define it uni 


vocally (97). In other words, whereas, on the one 

e given external movements in one instance mey signify 

us st vea essentially different behavior, so, on the other hand, 
in other ins J 
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given dissimilar external movements at different instances may denote 
essentially the same behavior. 

Only by ascertaining the relation of man, as a conscious being, to his 
fellow-man and to the world about him may the inner content or mean- 
ing of his behavior be gauged. This relationship both constitutes and 
defines the significance of man's behavior. Stated more generally, the 
very meaning or content of man's conscious experience and behavior 
rests on the unity of consciousness and activity. 

For these reasons psychology must be understood, according to the 
Soviet psychologist, as not only the science of the psyche in developmental 
action,® but as one whose fundamental unit is the conscious concrete 
act—not an act along which consciousness meekly tags in psychophysi- 
cal parallelism, but one that is ontologically different from a non-con- 
scious act or mere mechanical reaction to stimuli. J 

In this manner the traditional Cartesian dualism of mind vs. body is 
held to be not only circumvented, but overcome. There is, therefore, no 
necessity to establish the relationship of behavior, as something ex- 
ternal, to consciousness as something that has only internal relevance. 
Behavior is already in itself, for the Soviet psychologist, a unity of the 
outer and the inner. Likewise each internal psychological process is a 
unity of the internal and the external, which in the definitiveness of its 
content, reflects an internally or, rather, psychologically refracted outer 
world of sense-infused meaningful objects. Instead of trying to correlate 
or link an “inactive” consciousness to an "unconscious" activity, the 
Soviet psychologist unites instead an “‘active’’ consciousness with a 
"conscious" activity. 

The basic orientation of the Soviet psychologist, as expressed in x 
above principles (9, 93, 94, 119), force him to go beyond the bounds o ia 
purely functional treatment of the psyche which regards each psychic 


process as univocally determined from within functional-organic de- 
utually 


cies. They lead him to view each psychic process as m 
relations set up between man and the 


f historically developed social activity. 
o the unity of research and influence 
in his experimental programs. For the Soviet psychologist the psyche : 
thereby thrust out of its traditional residence in some Cer qam 
inner world. The study of the psyche is, therefore, Pon in : 
text of the very concrete situations involving the life an activity 
of oed people who, as they manifest their psyche, form and structure it. 
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STATISTICAL BIAS IN COMPARISONS OF MONAURAL 
AND BINAURAL THRESHOLDS: BINAURAL SUM- 
MATION OR BINAURAL SUPPLEMENTATION* 


MONCRIEFF H. SMITH AND J. C. R. LICKLIDER 
Harvard. University 


In the last decade a number of papers! have dealt with the problem 
of binaural summation at the absolute threshold of hearing: to what 
extent does central summation of excitation from the two cars make 
binaural hearing more sensitive than monaural hearing? Since the 
answer to the question is destined to be ambiguous if the two cars are 
not equally sensitive, and since few people have matched ears, the pro- 
cedure has been (1) to measure the two monaural thresholds, (2) to 
handicap the better ear by an amount equal to the difference between 
the two monaural measures, and (3) with the ears thus equated to de- 
termine the binaural threshold. The difference between the equated 
monaural thresholds and the binaural threshold, usually about 3 deci- 
bels, has been taken as a measure of binaural summation. 


cedure was first described by Urbantschitsch (12), who used an car 
plug to handicap the better ear. In the more recent experiments,! at- 
tenuators have been used to adju 


st the currents fed separately to two 
earphones. 


This pro- 


This procedure is analogous, 


I except for the adjustment of the sti- 
muli to the sensitivities of the tw 


á 9 receptors, to the one used in compar- 
ing monocular with binocular vision; and Pirenne (9) and Bárány 


(1, 2) have pointed out that, in the visual case, the procedure by its very 
nature assures that two eyes will see more clearly than one. 
Let us assume that the visual ac 


ate independently of one another . i antaneous thresholds 
for monocular vision have the same distributi 


one eye alone has the chance a of seeing the symbol, both eyes together have 
the chance 2a =a. As a is smaller than 1, this expression will always be greater 
than a—that is to say, two eyes will be able to see better than 
result of random combination. [Barany (2), p. 127] 
The same source of bias is of course i 
gested, in the procedure as applied to t 
other sense modalities. Tt is of interes 


one solely as a 


nherent, as Bárány (1) has sug- 
he determination of thresholds in 
t, therefore, to inquire how great 


* This report was prepared under contract NSori-76 between Harvard University and 
the Office of Naval Research, U. S. Navy (Project Nr 147-201, Report PNR-72). Repro- 
duction for any purpose of the U. S. Government is permitted. 


! Studies in binaural summation were reviewed in 1948 by I. J. Hirsh (6). 
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the bias is in the monaural-binaural comparison and whether it is in- 
fluenced by the step of equating the monaural thresholds before de- 
termining the binaural threshold. 

In order to estimate the magnitude of the bias, it is necessary to 
define the null condition under which we should say that there is no 


binaural summation. We can imagine, for this purpose, two monaural 
ght ear, the other with only a left ear. The 


atsoever of communicating with each 
e experimenter. To obtain measures 
aural and “binaural” sensitivity, the experimenter tests the two 
then together, in the latter instance 
recording a positive response whenever either listener reports hearing 
the stimulus tone. It need hardly be said that we do not propose this 
schema as a psychophysiological hypothesis. There is of course no 
doubt that the two ears of a single listener do send their messages to a 


rin which true summation may occur. Evidence of inter- 
in psychophysical experiments on sound 


d in electrophysiological observations 


at the levels of the cochlear nucleus and the temporal lobe. The schema 
is therefore a "null hypothesis" which a given set of data should lead us 
to reject before we say that the data constitute evidence of true binaural 
summation. 


As a first step in estimating the magnitude 
ion effect), let us consider a variation of the procedure in 


1 monaural thresholds are determined with very great 
he two listeners (or ears) are equated and the “bin- 
hat so many trials are averaged 
are for all intents and purposes the parametric 
values of the two monaural thresholds. Now, if we assume that in the 
“binaural” tests the thresholds of the two listeners are independent and 
are distributed normally and with the same variance about their para- 
metric values, we can reduce the procedure of the monaural-binaural 
comparison to (a) determining two monaural thresholds, a first and a 
second, for each listener, (b) finding the difference, again for each 
listener, between the first and the second (i.e., first minus second), and 
(c) recording the algebraically larger of the two differences thus found. 
In as much as the firs ch listener was fixed at his para- 
metric mean, the differences were simply the deviations of the second 
thresholds (with signs reversed) from the parametric means of the dis- 
tributions of monaural thresholds. It is evident, therefore, that the 
bias in the monaural-bi bias introduced by 


listeners, one with only a ri 
two listeners have no means wh 
other, but both report to the sam 
of mon 
listeners separately (successively), 


common cente 
aural interaction has been found 


localization and on masking an 


of the bias (i.e. of the 


supplementat 
which the initia 
precision before t 
aural" tests made. We can suppose t 
that the initial means 


t threshold of ea 


naural comparison is the 
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selecting the larger of two quantities drawn at random from a normally 
distributed population, the population of momentary monaural thresh- 
olds. It is consequently of interest to examine the sampling distribution 
of the larger of two random normal variates. This sampling distribution 
is shown in Fig. 1 in relation to the normal parent distribution. 


0.45 
040 


0.30 


N(x), F(x) 


0.20 


[*] 1 2 3 4 
X IN STANDARD DEVIATION UNITS 


: 
Fic. 1. A NORMAL Di AND THE SAMPLING DISTRIBUTION 
(SOLID LINE) or THE HIGHER OF A PAIR OF VALUES DRAWN FROM IT INDEPENDENTLY 
AND AT RANDOM. 


is obtained by multiplying the density of the 
air will be x; by the probability that the other 
ensity of the probability that the first member will 
then the sampling distribution-function is 


Fi(x) 2f.) [os Í E eus] 


Note that Fix) is approximately symmetrical, 
gether to be indicated separately in the figure. Th 
deviation unit (standard deviation of the normal 


probability that either member of the p, 
will be less than Xi. Tf fx; is the d 
be x; and the second less than Xi, 


The mean and median lie too close to- 
e mode is approxima tely 0.06 standard- 
distribution) to the left of the mean. 


The derived distribution js leptokurtic and positively skewed. 
Hastings, Mosteller, Tukey, and Winsor (5) have determined its stand- 
ard deviation, which is 0.826 times the standard deviation of the normal 
parent distribution, and its mean, which is 0.564 standard deviation 


units higher than the mean of the Parent distribution. (The assumption 


= ^ 
——ÓÁ——— 
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iie ut ey n not to be critical; the corresponding values 

i a 57 parent distribution is rectangular.) For prac- 

tical purposes, it is immaterial whether we define our “binaural” thresh- 

old as a mean or a median. The mean of the derived distribution is 

icm ss as edge (0.56) of be line marked “Mean and 
vt gure, the median (0.55) by the left-hand edge. 

An additional component of bias enters the picture as we go a step 
further and relax the condition that the initial monaural thresholds be 
determined with great precision. Let us first isolate this new component 
by assuming that the initial thresholds are averages of n fallible deter- 
minations but that the final thresholds are precisely determined in a 
single momentary trial. Again the bias enters through the recording of 
the larger of the two differences (first threshold determinations minus 
second), but the situation is somewhat altered. This time it is the 
second of the determinations that is assumed to hit the parametric 
value, and the random variable is the mean of z initial determinations. 
The amount of bias is therefore 0.56 times the standard deviation of 
the distribution of means of n determinations. As m increases, the 
amount of bias decreases. We should not expect the bias to decrease as 
1/4/n, however, because the successive determinations would probably 


not be entirely independent of one another. 

The final step is now a simple one. We must relax both restrictions 
simultaneously and consider both the initial and the final thresholds as 
final estimates of the parametric value. Once again the bias results from 
the selection of the larger of two differences, and again it is 0.56 times 
the standard deviation of the distribution of differences. This time, 
however, the bias is due in part to supplementation per se and in part 
augmentation of the supplementation by error in equating 
Neither of the terms upon which the 


difference depends (i.e., neither the initial nor the final threshold) is 


fixed, and it is more difficult than it was in the restricted cases to ex- 
press the bias as a function of the standard deviation of the distribution 


of momentary thresholds. 
If we were willing to assume not only independence of the two 
f successive momentary thresholds of a 


monaural listeners but also o 
single listener, the bias could be estimated as 


oe n+1 
B = 0.564/ o? +— = 0.560.4/ —— 
n n 


viation of the distribution of monaural thresh- 
determinations taken in equating 


to spurious 
the sensitivities of the two ears. 


andard de 


c, being the st 
ber of monaural 


olds, and » the num 
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the two ears. However, successive momentary thresholds are m ria 
not independent. We must have more information about the law o x 
pendence, or we must work directly with the distribution of übt : ; 
Even though the distributions of differences for the two listeners may 
not be exactly equal in variability (the initial thresholds were deter- 
mined successively, the final thresholds simultaneously, so a longer re 
elapsed between the two determinations for one listener than for the 


other), they can be expected to be approximately so, and we shall follow 
the latter course. 


1 24 : . i% : a 
We are now in a position to consider the question: is the bias cor 


ceivably great enough to account for the differences between equated 
monaural thresholds and binaural thresholds that have been found in 
experiments on binaural "summation"? "These differences, for tonal stim- 
uli, have run from 2.5 to 3.6 decibels. Differences obtained with ran- 
dom noise (10) have been slightly smaller. Taking 3.0 decibels as a 
typical value, we see that the standard deviation of the distribution of 
differences would have to be 3.0/0.56=5.4 decibels if binaural supple- 


What we should like to know, ideally, 
distribution of differences between Pairs of monaural thresholds, the 
initial threshold of each pair being based on n determinations of the 
“momentary” threshold and the final threshold in each pair being based 


uld hardly expect to find exactly this 


lute threshold of hearing, but except 
at the determin 


is the standard deviation of a 


thresholds, we should 
tion of a distribution 
thresholds. Hughes (7 
approximately 0.5 decibe] 
ranging from 0.78 to 1.22 decibels. 


individual listeners; the mean of these 


equivalent of a standard deviation of 1.7 
decibels. And, finally, Shaw, 


: Swman and Hirsh (11) give standard 
errors of mean differences at eight different frequencies, each value cal- 


culated on the assumption that the 26 contributing differences were 
independent measures; the root 


standard deviations is 2.4 decibels, 
deviations at their face values, it is 
ably greater than the expected valy 


bs 


fa ay 


44 


— » 
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fact room for almost no doubt about the statistical significance of the 
difference between observed effect and expected bias. 

It should be recalled, however, that the expected value of the bias 
depends upon the standard deviation of the distribution of differences 
between (1) either individual or averaged values of the initial threshold 
and (2) individual (not averaged) values of the final threshold. We must, 
therefore, inquire into the natures of the final (binaural) thresholds 
which entered into the differences, the standard deviations of which we 
have listed in the preceding paragraph. The psychophysical procedure 
employed in the experiments on binaural summation has been, without 
exception, one of the variations of the method of production. The lis- 
tener has either manipulated the attenuator control himself or has sig- 
nalled the experimenter which way to adjust it. No time limit has been 
set for the determination of the values that have been recorded. It 
would be surprising, therefore, if the quantity recorded were not some 
kind of an average, a subjective or intuitive average, of momentary 
thresholds tentatively decided upon during the course of the determina- 
tion. If such were not the case, it would be difficult to justify the dif- 
ferences in precision among the experiments to which we have referred. 
It would be difficult to explain how such precision as that indicated by 
Hughes’ 0.5-decibel standard deviation of differences could be obtained 
when the momentary threshold is found by Lifshitz (8) to vary as much 
as 6 decibels in 5 seconds. 

If each threshold determined by the method of production is, as 
we believe it to be, more nearly an average than a momentary value, 
it appears quite probable that the standard deviation of the momentary 
values that are averaged might be as great as the 5 or 6 decibels re- 
quired to yield a bias of 3 decibels. We should have to conclude then 
that the experiments on binaural summation" at the absolute threshold 
did not prove the existence of a summative process in binaural hearing. 

As matters stand, it is important to have more information on 
several points before we can reject the hypothesis of binaural supple- 
mentation. Three questions are of particular relevance: 

e are the momentary sensitivities of the two ears corre- 


s of supplementation has assumed that the correlation is 
ds are independent. If the correlation should prove 
to be high and positive (bodily noises would tend to influence both thresholds 
in the same way at the same time; fluctuations of attention might also favor 
positive correlation), we could expect little supplementation, and it would be 


easier to demonstrate the existence of a true summative process. 
2. To what degree are successive, momentary, monaural thresholds cor- 


related? What we should like to know in this connection is the autocorrelation 


1. To what degre 


lated? The hypothesi 
zero, that the two threshol 
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function of the monaural threshold. Since the Fourier transform of the auto- 
correlation function is the power spectrum, we should—with that information— 
be able to speak precisely about “rates of fluctuation” of the threshold. 

3. How does the precision of a threshold determination depend upon the 
length of time spent in making it? We can regard the listener in a threshold 
test as a sort of statistical problem-solver, basing an estimate of central 
tendency (or perhaps of the significance of a difference) on data fed in, in the 
form of time series, through his sense organs. Adopting this point of view, we 
see that, if the variability of thresholds determined by the method of production 
is to have any meaning at all, something has to be said about the length of the 
time series on which the thresholds are based. A listener with a perfect memory 
could achieve infinite precision if he were given time enough. 


With information on the three questions 
decide whether or not experiments on binaural summation yield evi- 
dence of binaural summation. Without such information, it can indeed 
be said that two ears are better than one by about 3 decibels, but the 
theoretical implications of the experimental results are of considerably 
less force than they would be if they provided solid ground for rejecting 
the null hypothesis of supplementation. 


, it should be possible to 
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THE GENETICS OF SCHIZOPHRENIA 
A SPECIAL REVIEW 


NICHOLAS PASTORE 
Hunter College 


Nice cee in 1929 and reported by Kallmann! in his 
O0 e zophrenia, represents the most extensive and am- 
. bitious investigation undertaken in recent years in the attempt to 

demonstrate conclusively a measurable and necessary biological com- 

ponent in the etiology of schizophrenia. This work has been widely 

cited as having conclusively established a genetic predisposition for 

schizophrenia, a mental disease which is responsible for the largest 
pitalized cases for mentalillness. For these reasons, 
hodology is quite desirable. The immediate 
is the recent presentation by Garrison 
This particular presentation con- 
n and in the interpretation of 
Kallmann's data, which were quite puzzling to this writer, since Gar- 
rison's presentation was intended simply as a summary of Kallmann's 
major conclusions. It is hoped that the present review will at least give 
a more accurate picture of the Kallmann investigation. 


proportion of all hos 
a review of Kallmann's met 
stimulus for the present review 
of the conclusions of Kallman's text.? 
tains many errors, both in the quotatio 


KALLMANN's GENERAL PROCEDURE AND CONCLUSIONS 


on is to determine the extent 
oward schizophrenia. Kall- 
of 1,087 institutionalized 


The object of the Kallmann investigati 

and nature of a biological predisposition t 

mann's procedure involves an extended study 
schizophrenics (termed “probands”’), their ancestry and descendants. 

These 1,087 schizophrenics represent the total hospital population for 

E schizophrenia admitted to the Herzberg Hospital, Berlin, in the period 
1893 (the year in which the Herzberg Hospital was opened) through 

1902. The diagnosis of schizophrenia was made by Kallmann upon 

of 15,000 individuals contained in the 


of the case histories 
1 number of individuals suffering from 


examination 
ota 


archives of the hospital (the t dual: f 
all types of mental diseases admitted to the hospital in this ten year 


period). On the basis of the available data, Kallmann made a diagnosis 
even though his own diagnosis may have been different from the ori- 
ginal diagnosis. This was deemed necessary since “an earlier attempt 
to sift out the suitable cases of schizophrenia with the help of former 


z. The genetics of schisi ew York: J. J. Augustin, 1938. 


ophrenia. Ni 
MER. The genetics o J. abnorm. soc. Psychol., 


1 KALLMANN, FRAN. í ] 
2 GARRISON, JR., MORTI f schizophrenia. 


1947, 42, 122-124. 
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records on commitment and diagnosis had been abandoned as uscless : 
(p. 13). Kallmann established his diagnosis “on the basis ar Send 
logical and psychiatric findings, the onset and progress of the psyc eed 
and, when possible, on the later findings of autopsy (p. 13). Int De 
cases "where there was a suspicion of Schizophrenia we supplemente 
the records with Whatever material could be obtained from other hos- 
pitals and institutions, until we were able to make an sect ranted 
diagnosis” (p. 13). Furthermore, "all cases in which exogeneous factors 
might have played a decisive role in the origin or development of a 
psychosis that resembled Schizophrenia" were excluded (p. 14). For 
example, those cases Which seemed to "origin 
of tuberculosis" were ruled out. 

The ancestry and descendants of the 1,087 selected cases were 
examined. The total number of individuals, the probands and their 
ancestors and descendants, involved in this study was 13,851. Since 
not all of these individuals survived into the danger period for the de- 
velopment of Schizophrenia, the actual number involved is perhaps 
slightly more than one half of the 13,851. A basic part of Kallmann's 
Study involved the calculation of expectancy figures for developing 
schizophrenia in the children of the Probands, in the siblings of the 
Probands, and in the parents of the probands, These expectancy figures 
for particular family lines were then compared to the presumed expect- 
ancy figure of 0.85% (Luxenburger's estimate) for the general popula- 
tion. There was a further subdivision of expectancy figures according 
to knowledge of the type of mating that Was involved. Separate ex- 


for example, for those proband- 
g where both parents were schizo- 


ate as a clear consequence 


according to type of mating was in- 
mation with regard to the biological 
ith Kallmann's Mendelian outlook, 
hrenic should produce only schizo- 


typically schizophrenic, Another ma; 
the breakdown of schizophrenia in 
» and simple (S). Separate expectancy 
figures were calculated for offspring wh 
classified according to these clinical forms. 


b^" 
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A distinctive feature of Kallmann’s work is a detailed diagnosti 
count of the 108 children of probands who were du s 
phrenic (these proband-children are termed "secondary cases"). This 
makes it possible for other scientists to evaluate Kallmann's diagnoses 
of schizophrenic proband-children. It should be mentioned that Kall- 
mann divided these 108 schizophrenics into two categories: definitely 
schizophrenic (92) and doubtful (16). 

Kallmann's major conclusions are as follows: 


1. For the present we can prove schizophrenia only in a bare 10 per cent of 
the immediate ancestry of schizophrenic cases; but the total number of an- 
cestors obviously tainted with the hereditary predisposition to schizophrenia 
and belonging to the heredity-circle of schizophrenia comes, in the direct line, 
to at least one third of the parents, uncles and aunts (p. 42 f.). 


2. .. . we obtain in the generation of our proband-children the following 
total expectancy figures for the four clinical sub-groups. (p. 107). 
Schizophrenia Schizoidia 
H group: 20.7% 31.9% 
C group: 21.6% 30.2% 
P group: 10.4% 35.6% 
S group: 11.6% 31.7% 
[The term “schizoidia” is used by Kallmann to refer to “eccentric borderline 
cases" (p. 104).] 
3. ... the children of hebephrenics and catatonics have about double the 
paranoid and simple probands... 


frequency of schizophrenia of the children of 
(p. 108). [This difference leads Kallmann to label the hebephrenics and the 
catatonics as the “nuclear group” and the paranoid and simple schizophrenics 
as the “peripheral group" (p. 108). Kallmann is inclined to believe that a 
“biological law” is involved (p. 108).] 
4. ... the expectation of schizophrenia for an undifferentiated group of 
proband-children has been found to be 16.4 per cent. ... The total expectancy 
sisters and half- 


figure for siblings of schizophrenics is 11.5 per cent, for half- 
brothers, 7.6 per cent, for nephews and nieces, 3.9 per cent, and for grand- 


children, 4.3 per cent (p. 163). 


5. The probability of schizop 


parents amounts to 68.1 per cent ir 
ombination 


the offspring of the c c 
breeding proportion of one schizophrenic and one 
yields expectancy figures of 14.8 to 23.9 per cent (p. 164). : i 
6. The genetic connection between both disease groups (schizophrenia and 
tuberculosis) is proven by the consistent increase of the mortality from tubercu- 
losis in all sub-groups of our survey, and particularly by our finding that this 
death rate is much higher in the descendants of the nuclear group than in those 
of the peripheral group, and that it is always highest in the categories with the 
largest figures for the expectancy of schizophrenia. ... We... discover in all 
categories of consanguinity such a startling correspondence pe expectation 
of schizophrenia and mortality from tuberculosis, that the ratio of frequency be- 


hrenia in the children of two schizophrenic 
a our survey, while it is only 9.1 per cent for 


of two heterozygotic parents. The cross- 
non-schizophrenic parent 
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s ; > d 
tween the siblings and children of probands, and again between the ea 
children and children, is exactly the same for the expectancy of schizophrenia 
and for the mortality from tuberculosis. 


This statistical result is so unequivocal that it excludes all possibilities of 
coincidence, and can only be interpreted as a genuine gene-coupling of the 
tendency to schizophrenia and the heredito-constitutional susceptibility to m- 
berculous infection. It also indicates an identical pattern of hereditary trans- 
mission for these two predispositions, and confirms the assumption that both 
schizophrenia and tuberculosis represent recessive traits (p. 271). 

Although these conclusions are based exclusively on German ma- 
terial, Kallmann thinks that they are also applicable to American popu- 
lation groups (p. xv). 

It should not be understood th 


at the foregoing quoted conclusions 
represent all the major conclusion 


s or that other conclusions are of no 
interest. The quoted conclusions were selected because they reflect the 
initial questions underlying the investigation and because they are 
relevant to the criticisms which will be considered in the next section. 
Parenthetically, it should be pointed out that Kallmann is deeply 
interested in eugenic questions and the eugenic implications of his 
conclusions receive major emphasis in his writing. 


KALLMANN’s METHODOLOGY 


erzberg Hospital during the interval 1893- 
an twenty-five years later. 
inges on the correctness of 
ned, in the way of offering 
e, that Kallmann Should have had other psy- 
cal records (or some sample of records) to de- 
of diagnoses. Possibly with this in mind, Kall- 


chiatrists check the clini 
termine the consistency 
mann wrote: 
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clinical records, no atime H- ie - i = Ages jie 
ET i 5 gnostic error is possible. This defect 

could have been remedied, in part, if Kallmann had re-rated a sample 
of the clinical records after a period of time and in this way determined 
the reliability of his ratings. A possibly more serious error of indeter- 
minate magnitude involves the reliability of the clinical records them- 
selves. Psychiatrists of the last century must have been guided by the 
then contemporary diagnostic conceptions and psychiatric theories in 
observing and recording the behavior of an individual. If this was the 
case, then the clinical records would unalterably reflect that fact. 
Kallmann himself seems to have noted this shortcoming when he ob- 
served that his “survey” was “limited to case histories that were made 
decades ago and described according to obsolete medical methods and 
psychiatric conceptions" (p. 22). 
Although an unknown margin of error might have been involved in 
the classification of the 1,087 probands as schizophrenics (with a pos- 
sibly additional error by virtue of segregating each of the 1,087 pro- 
bands into one of the four clinical categories), at least one fact is clear— 
the probands were institutionalized for mental illness. In classifying 
the ancestors and descendants of these probands, however, the insti- 
tutional criterion was completely lacking in a large number of cases. 
Consequently, many individuals must have been diagnosed on the 
basis of “anecdotal” materials. This would be especially true for the 
parents and grandparents of the probands— individuals who probably 


matured in the first half of the nineteenth century and late eighteenth 
ype of diagnosis for these individ- 


century. Establishing a particular ty 
be quite risky. From the point of 


uals, almost a century later, seems to 
view of genetics, anecdotal information concerning parents and grand- 
parents of probands is worthless, notwithstanding the diligence of the 


investigator. 
Precise kn 
bands would have 
it possible to relate 
offspring. Furthermore, 


spouses was in doubt, at n 1 
have been obtained, as was done in the case of the probands. Kallmann, 


however, deviated from his previous definitions and classified the pro- 
band-spouses in rather unclear terms. For example, one subdivision of 
proband-spouses places into one category those spouses who are both 
“schizophrenic or suspicious of schizophrenia (p. 154). Another single 
category includes those spouses who are “epileptic, feebleminded, or 


diagnostic categories. 


owledge concerning tatus of the spouses of the pro- 
been of considerable interest, for it would have made 
the characteristics of the spouses to those of their 

f some of the proband- 


even if the status 0 
least information about their ancestry could 


the s 
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suffering from an organic nervous disease" (p. 154). It is not known 
vhether institutionalization was an importa 
cation of proband-spouses as sch 
orobands. Thus, it is unknow 
:o the same set of diagnostic 
spouse. 

The diagnosis of schizo 
‘secondary cases," also off. 
ncludes 108 secondar 
schizophrenic cases” and 16 as “‘doubtf 


nt criterion for the classifi- 
izophrenics, as was the case for the 
n whether the term schizophrenia refers 
criteria for both proband and proband- 


ully schizophrenic cases." The 
in Kallmann's work is unknown. Will 
o definite schizophrenics? Should the 
be given equal weight in various calcu- 

st question could not, of course, be con- 
lusive. With regard to the second question, many of Kallmann's 
najor conclusions presuppose that the "doubtfuls" and the ''definites" 
ire, in fact, of equal value, Thus, some 
o the expectation of Schizophrenia i 


n proband-children actually in- 
lude the “doubtfuls.” 


For instance, K 
i 4% subsumes 
(see Table 38, p. 
rough two sets of 
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It would seem at least that different diagnostic criteria were involved 
in classifying the 6 non-institutionalized secondary cases, who were 
dead before the investigation began. For these 6 ndadi, Kalimann 
must have utilized the judgments and observations of psychiatrically 
untrained individuals as a basis for his own classifications. 

. The methodological purist would have some justification in claiming. 
with regard to the consistency of diagnostic procedures, that the ons 
institutionalized cases, and certainly the "doubtfuls," should be ex- 
cluded from the schizophrenic category. This would reduce Kallmann's 
108 secondary cases to 73. A strong case can probably be made for the 
inclusion of the 35 non-institutionalized individuals but, then, Kall- 
mann's conclusions and inferences should not be set forth with the de- 


at Kallmann attaches to them. 
hich deals, not with diagnosis as such, 
“proband” and "secondary 


he 108 secondary cases shows 


gree of certainty th 
Another question remains w 
but rather with definitions of the terms 


case." An examination of the records of t 
that 8 of these cases had been institutionalized in the Herzberg Hos- 


pital in the interval 1893-1902. One of Kallmann's criteria for selecting 
probands involved admission into the Herzberg Hospital in the 1893- 
1902 interval. Consequently, there are 8 secondary cases which, accord- 
ing to the criteria, should be classified also as probands. There are three 
possibilities: (a) Kallmann did not adhere rigidly to his criteria (a thing 
which he is entit d accordingly did not classify 
the 8 secondary cases as probands. (b) Kallmann classified the 8 second- 
ary cases as probands and excluded their 8 schizophrenic parents from 


the proband material. If this was done, then Kallmann's figure of 92 
renic cases is reduced by 8. (c) Kallmann 


definite secondary schizoph 

classified the 8 individuals both as secondary cases and as probands. 

That is, these 8 individuals were given a double statistical value. If 

this procedure was followed, then an inflation of incidence figures oc- 

curred in two directions. When determining the proportion of proband- 

children who were schizophrenic, the 8 secondary cases were not classi- 
When determining the 


fied as probands but simply as secondary cases. 
proportion of parents of probands who were schizophrenic, the 8 second- 
ary cases were classified as probands, and then 8 of the probands were 
shifted into the category of parents. 

This last question was considered in detail because Kallmann makes 
no mention of the point. Doubt is raised as to the exact procedure fol- 
lowed by Kallmann (he should have followed procedure [a] described 
above) because of his arbitrary statistical treatment of the following 
analogous issue. Three secondary cases had parents both of whom were 


led to do in this case) an. 
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probands. Kallmann gave these 3 secondary cases a "'double statistical 
value," that is, they were "counted twice," in calculating the expecta- 
tion of schizophrenia in proband-children (p. 165). This dubious pro- 
cedure, of course, increases the size of the expectation and incidence 
figures. (In our discussions up to this point, we have been citing the 
figure of 108 secondary cases. Actually, Kallmann speaks of 111 second- 


ary cases—the 108 secondary cases plus the 3 cases of double statistical 
value.) 


Sampling Difficulties 


For a problem such as the one at hand, it is very difficult, if not 
impossible, to obtain a random sample. One can receive some assurance 
that a sample is not unduly affected by selective factors by comparing 
some characteristics (characteristics which are independent of schizo- 
phrenia) of the sample with those of the general population. The char- 
acteristics chosen for comparison are themselves selected, knowingly or 
otherwise, from a larger population of possible characteristics. There- 
fore, the coincidence of some selected sample characteristic with the 
population value should not lead one to conclude that the sample is 
random or that the sample is not affected by selective factors. On this 
point, Kallmann claims that his sample was “random” (p. 7) when, in 
fact, it was not. By the word random he probably meant that the 
values of certain sample characteristics (for example, male-female 
ratio) coincided more or less with those of the population at large. In 
the interests of clarity, Kallmann should not use an already precisely 
defined term, that is, "random," in a different sense. Moreover, it should 


be mentioned that for the particular characteristics discussed by Kall- 
mann, the values prevailing in the 


general schizophrenic population are 
actually unknown. 


$ In choosing the sample characteristics, Kallmann 
should have considered other properties relevant to his problem. For 


instance, the question of whether the distribution of his schizophrenic 


individuals according to the form of schizophrenia was the same as the 
one which prevailed for the general 


à : 5 population of schizophrenics should 
have received some discussion. 


The foregoing general methodological considerations should have 
served, at least, as a note of caution in generalizing his conclusions from 
the available data. Kallmann, however, asserts his major conclusions 
as unassailable findings. The presence of possible selective factors gov- 
erning the inclusion of individuals in his sample, should have served as 


a further note of caution. These possible selective factors will receive 
some brief discussion. 
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cannot neglect the possibility that, generally, only the 
j istitutionalized (and, in particular, at the Herzber 
Hospital), and that such cases were more likely to have an inherited ae 

. There were marked age differences among the probands. The range of the 
birth-dates extended from 1820 to 1887 (p. 24). These differences were reflected 
in the wide age differences which prevailed among the probands classified ac- 
cording to the form of schizophrenia (cf. table 3, p- 24). 

A special selective factor, which affects the inclusion of all probands, con- 
cerns the relative proportion of remissions among the various schizophrenic 
disease-forms. For example, those aged schizophrenics who were institutionalized 
for decades probably had the smallest proportion of remissions and this, in 
turn, might be related to the relative incidence of schizophrenia in their off- 


spring. 

The fact that Kallmann 
ayed a decisive 
that resembled schizophrenia” (p. 14) 


factor. For example, "wherever we foun 
of schizophrenia dependent on the question of whether the psychotic symptoms 


occurred before or after the active tuberculosis had broken out. We tried, at 
least, to establish that schizophrenia did not originate as à clear consequence o 
the tuberculosis” (p. 14). Thus, Kallmann excludes those schizophrenias which, 
according to his statement, are environmental in origin. If such schizophrenics 
had been included in his calculations, the effect probably would have been to 
decrease the expectation of schizophrenia in the offspring. " 

After having chosen 1,087 probands, Kallmann “dropped 40 cases because 
“some new doubts arose. -- as to the unassailable certainty of the original 
diagnosis of schizophrenia" (p. 15). Kallmann provides no definite information 
on the nature of these doubts. Furthermore, although these 40 cases "produced 
a series of undeniable secondary cases of schizophrenia" (p. 15), it would be of 
interest to compare the incidence of schizophrenia in the offspring of these 40 
probands with that of the offspring of the other probands. Kallmann sometimes 
includes the 40 cases in his calculations and summaries without justifying their 


inclusion. 

Proband-children. It should be mentione 
spring of a selected oe pre pete 
fertile. Inasmuch as probands of the ori : 
children? (there were 510 married probands) it would be of interest to know 
whether married and unmarried probands differed with respect to certaim rele- 
vant characteristics. For instance, is the ancestry of the unmarried probands 
more “tainted” than the ancestry of the married probands, or, is the distribution 
of “tainted” siblings proportionately the same for both groups ; 

The married probands produced 2,000 children, of whom approximately 
1,000 died before having reached the twentieth year. Ithough it is not possible 
to get any information on such deceased individuals, the selection of data which 
is thereby introduced should have served as another reason for caution 1n estab- 


lishing rigorous conclusions. 
The sample consisting of 1 


Proband material. We 
more severe cases were it 


in which exogenous factors 
development of a psychosis 
seems to be a contributory selective 
d tuberculosis we made the diagnosis 


“ruled out all cases 


might have pl role in the origin or 
I 


d that such children were the off- 
those probands who were 


1,047 probands prod uced 


11 secondary cases is too small for the intensive 


3 See table 12, p. 52 for these figures. 
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T 3 i is dis- 

statistical analysis which is rM (a specific example of this point i 

i next section under $ . oint 
“woe proband- parents, proband-grand parents. Wi risen to pa xD 
out that the various expectancy figures involving these a EA: 
incomplete data; the data were probably least complete: x p di 
parents. The effect of such selection of individuals (individua 2 v edens 
because of availability of information concerning them) upon the v 
pectancy figures is quite unknown. 


Tabular and Statistical Presentation of Data 


" . C nn 
For a ready and complete understanding of his data, the Kallma 


-atisti en- 
monograph is generally inadequate: the tabular and statistical pres 
tation of the data involve some Shortcomings. 


a zi -ealing 
ts are sometimes given in terms of percentages without revea 


on which the percentages are based: in some con 
numbers are too few to warrant any generalizations (cf. p. 112, table ki: 
Table 38 deals with the "differentiated expectation of schizophrenia and son g 
s table, the expectancy figures for erii 
phrenia fall into 24 different categories—that is, there are 24 cells among - on 
the schizophrenic proband-children are distributed for purposes of calcu a i 
e 111 schizophrenic children a 
, each of the 24 expectancy figures is based on 
olving children of simple-schizo- 
meaningful since these probands produced only 
4 Schizophrenic children. 


verlapping categories are Sometimes used. For example, in table 10, p- 
40, which deals with the “incidence of schizophrenia” in the ancestry of legiti- 
mate probands, the following ty erlap: “schizophrenia in one 


parent,” and “schizophrenia 1n one parent and other abnormalities in the second 
” 
Parent. 


these points). One definite schizo i 

cipient schizophrenic’! (p. 191, case #89). 
allmann consistently chooses the hi 

figures, where a choice 


differential choice is t 
Schizophrenics, As stated before, 
of computations, one including, 


r A T individ- 
* The 35 non-institutionalized individuals includes the 16 "doubtfuls." One indi 


5 . 180, 
ual, due toa typographical error, obviously hasa wrong birth-date in the record (p. , 
, 

case #56). 
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“doubtfuls” are excluded, and 11.5% if 5 i 

u l ^ 1 5% if the ‘‘doubtfuls” are included. s 
senting his overall conclusions, the 11.5% figure is cited as the wise tie 
schizophrenia in proband-siblings (see p. 287 supra for a quotation of Kall- 


mann's conclusion on this point). A similar situation prevails in the calculations 
n (see pp. 106 ff.). 


of the expectation figures for proband-childre 
m e oon 2 e responsible for Kallmann's differential 
esi Pate at there exists no generally accepted objective method 
[ g expectancy figures. In chapter 4, Kallmann calculates the ex- 
pectation of schizophrenia in proband-children in three ways—two of the meth- 
ods were developed by Weinberg and the other by Ilse. The two Weinberg 
methods yield expectations of 14.4% and 16.4% and the Ilse method yields an 
expectation of 12.8% (pp. 139-141). The Ilse method, according to Kallmann, 
has certain objectionable features which make it unsuitable for the purposes at 
hand. Of the two Weinberg methods, the method which Kallmann recognizes 
as “doubtless the more accurate one” (p. 139) yields the expectation figure of 
14.4%. The other Weinberg method (the abridged method) yields a figure of 
16.4% which is the figure cited by Kallmann in his over-all conclusions, and 
throughout the book. Kallmann’s reason for choosing the 16.4% figure is that 
the method for its calculation is less complex than the other Weinberg method 
and the fact that "the final results in both methods correspond so closely that 
they may be taken as sufficient evidence of the statistical reliability of all our 
other expectancy figures calculated by the abridged method" (p. 139). In the 
interest of accuracy, Kallmann's procedure seems quite dubious—if the method 
which yields a 14.4% figure is the more accurate one then that is the method 
which should be used regardless of its complexity. Moreover, the Closeness of 
two results (the 14.4% and 16.4% figures) for calculating a single expectancy 
figure is nota sufficient guarantee that such closeness will obtain in other cases. 
It was the disparity between the two Weinberg methods which led Kallmann to 
use the Ilse method which, in turn, was rejected (pp. 139 ff.). 

5. At least one basic table cannot be understood by the writer as it stands. 
Table 58, p. 157, deals with the “expectation of schizophrenia and schizoidia in 
the offspring of two schizophrenics.” The expectation that such offspring will 
develop schizophrenia is 68.195 and the expectation that they will develop 
schizoidia is 45.796. It would seem, then, that the probability that the offspring 


of parents both of whom are schizophrenic will be either schizophrenic or 
It is possible, of course, that Kallmann has overlap- 


schizoidic exceeds 100%. 1 verla 
ping categories in this particular case. But Kallmann does not discuss this point 
and there seems to be little justification for overlapping categories 1n this case. - 
If schizophrenic offspring were initially schizoidic then they should be classified 
only under the schizophrenic category. 

6. Occasionally the numbers which appear within the body of a table are not 
consistent with other statements (a conspicuous example of this kind is dis- 
cussed below). On page 12, in discussing the selection of individuals as probands, 
Kallmann states as à criterion of selection: “They must have been schizo- 

have become 


phrenics beyond the possibility of doubt, and the disease must 
[forty." Yetin table 53, page 140, which gives the 


manifest before the age © i 53 s 
“age distribution of probands at disease-onset,”’ it is seen that disease-onset for 
27 probands was at the age of 40 years or later. Furthermore, 3 schizophrenic 
proband-children had a disease onset in the fortieth year or beyond. 
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7. Legends at head of tables are sometimes inaccurate (sec the following 


discussion). 
INCIDENCE OF SCHIZOPHRENIA IN THE ANCESTRY OF LEGITIMATE PROBANDS 
(Adapted from Kallman's Table 10, p. 40.) 
: Definite Schizophrenia in a Parent, Grandparent, Aunt 
Yee T or Uncle (percentages) 
egitimate 
Probands a) (2) (3) (4) (5) 
Schizo- 
pos of phrenia in . 
inn Seige: | soies Pe Parent) cm0 || Selso 
Pree: | goo P " Send j 50 | and other phrenia |phrenia in 
lute pid iii = abnor- ina a sibling | Total 
No. ig oy uF i malities grand- of the 
paren garni in the parent parents 
second 
parent 
H 414 84 3.9 0.2 2.2 1.0 2.9 
P 225 89 3.1 0.9 1.8 2:7 PA 1.1 
c 136 90.7 2.2 — 0.7 1.5 2.9 A 
S 141 93.4 1.4 E 1.4 1.4 1.4 5.7 
Total No. | 916 87.5| 3.1 0.3 1.7 1.5 2.6 9.3 


In interpreting Table 10, Kallmann writes, 
bands, roughly one third have a schizophren 
grandparent, aunt or uncle” (p. 38). 
age, not even a tenth of our 


"Of the legitimate pro- 


are intended to indicate the percent 
her a schizophrenic or a schizoid an- 
the base of the various percentage figures is not 
s but rather the number of legitimate probands. 
Although Kallmann is probably aware of this fact, yet he slips into the 
ious percentage figures with their incorrect base. 
that the legend of Table 10 does not correspond 
€. The legend refers to the “incidence of schizo- 
y of legitimate probands,” whereas the last row 
€ percentage of probands who have schizophrenic 


It should be noted 
to the body of the tabl 
phrenia in the ancestr 
of numbers refers to th 
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nte. Kallmann falls into the same error when he concludes that 
‘no more than 10.4 per cent of the proband-parents could be ascertained 
to have been definitely schizophrenic” (p. 164). This error just doubles 
the number of schizophrenic parents of the probands. Of the 916 legiti- 
mate probands, 3. 1% of them had one schizophrenic parent and 0.3% of 
them had two schizophrenic parents. The total number of schizophrenic 
parents is therefore, at most, 31 (3.4% of 916). On the other hand, if 
the number of parents (916 X2) is incorrectly used as the base for the 
3.4% figure, then the resulting figure would be 62. Similarly, this error 
would quadruple the number of schizophrenic grandparents of the 


probands. 
n has overlapping cate- 


ists in this table. Kallman 
(Compare the 


Another error ex 
e effect of raising the percentages. 


gories which have th 
legends in columns 1 and 2 with the one in column 3.) 

The 9.3% figure has a valid interpretation (ignoring the question of 
overlapping categories), viz., 9.3% of the probands, roughly 90 pro- 
bands, will turn up one schizophrenic ancestor, if his ancestry is exam- 
ined in the manner prescribed by Kallmann. But, then, if the ancestry 
of the probands is examined sufficiently carefully and extensively, all 
probands will almost certainly lead toa schizophrenic ancestor. 

Consider now Kallmann’s conclusion that 10.4905 of the proband- 

(this conclusion was not in- 


parents were definitely schizophrenic 
cluded in the list cited on pp. 287-288.) For one thing, the term 


parent is used to cover ancestors in general. Also, “other abnormalities" 
(see column 3) are classified under schizophrenia. In any case, the 
correct percentage to convey the idea which Kallmann intends is 1.7% 
(31 schizophrenic parents distributed among 1,832 parents). This per- 
centage is considerably reduced if all the investigated ancestors of the 
probands are listed under the term of ‘parents,’ as is done by Kallmann. 
This section refers to Tables 34-37, PP- 106-110, which deal with the 
“probability of schizophrenia and schizoidia" in the children of the 
probands. Separate probability figures are also given for proband- 
children depending on the form of schizophrenia which characterizes 
the probands. According to these tables, the probands produced a total 
of 111 children who were definitely or doubtfully schizophrenic. These 
411 children are distributed according to the form of schizophrenia of 
the proband. In chapter 5 of the book, Kallmann presents an account 
of each proband-child, ma ive at another estimate 
ure of 10.4% was obtained. It probably represents, how- 

the one based on legitimate probands, the 


king it possible to arr 


5 [t is not stated how the fig 
average of two percentages, 


ever, à weighted 
ate probands. 


other on illegitim 
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of the distribution of the 111 children. The two distributions, the one 
based on the data presented in Tables 34-37 and the other based on 
chapter 5, are compared in the following table. 


Distribution of the Number of Schizophrenic 
Children According to Form of Schizophrenia 

Form of Schizophrenia in Proband 
in the Proband 


(Tables 34-37) (Chapter 5) 
Hebephrenia 55 55 
Catatonia 23 23 
Paranoia 22 29 
Simple 11 4 
Total No. 111 111 


The discrepancies between the two distributions constitute a serious 
weakness in Kallmann’s conclusions since many percentages are based 
upon figures given in the column entitled “Tables 34-37” in the above 
table. There is reason to believe that the distribution obtained from 
chapter 5 is the accurate one, for each individual schizophrenic proband- 
child is described and classified according to the type of schizophrenia 


in the proband. The discrepancy is especially significant in so far as the 
S group is concerned. Kallmann’s expectancy 


children whose parents fall in this group is 11 
centage is based on 11 schizophrenic children 
should be 4. With a correction for this er ' 
about 4.0%, Parenthetically, 
produced no children of like me 
cording to chapter 5). Table 
children were produced by 


Tis Paragraph deals with Tables 44-47 (pp. 125-128) which give 
the "probability of Schizophrenia and schizoidia" in the siblings of me 
probands. The number of siblings involved in the various calculations is 
close to 2,000. ` The tables provide information as to the number o! 
siblings who originate in five different types of matings. The two mat- 
ings of relevance are those which involve "one parent sci S the 
other normal or psychopathic” and “one parent schizophrenic, the 
other schizoid or suspicious of Schizophrenia" (p. 125). These two 
types of matings produced 366 of the 2,000 albifisps. On the basis of 
these data, it is possible to form a minimum estimate of the total num- 

who were schizophrenic. The average 


ber of parents of the probands 
number of children per family Produced by the parents is 4.5, the divi- 


figure for the proband- 
6% (p. 107). This per- 
whereas the actual figure 
ror, the 11.6% is reduced to 
female simple-schizophrenic probands 
ntal disease or any mental disease (ac- 
37, p. 110, indicates 8 simple schizophrenic 
such probands, 


THE GENETICS OF SCHIZOPHRENIA 299 


sion of 3.5 into 366 yields about 104 family units wherein each famil 
unit contains a schizophrenic parent. (In the preceding MH en 3.5 
was used as the divisor since one of the offspring in a family EEE 
proband.) This estimate of 104 schizophrenic parents contradicts the 
value of 31 schizophrenic parents which was obtained from Table 10 
p. 40. This contradiction calls for a clarification. f 
Mayor CONCLUSIONS 


Kallmann’s major conclusions were presented in an earlier section 
of this review. Each of these will now be considered in light of the fore- 
of Kallmann’s methodology. The numbers in the 
on correspond to the numbering of the conclusions on 


EVALUATION OF KALLMANN’S 


going discussion 
following discussi 
page 287. 


1. “For the present we can prove schizophrenia only in a bare 10 per cent of the 


immediate ancestry of schisophrenic cases." On the basis of his own data, this 
conclusion is incorrect. The ten per cent figure does not tell us anything about 
the proportion of proband-ancestors who were schizophrenic. For this per- 
centage is not based on the total number of ancestors, but rather, on the number 
of probands who have a schizophrenic parent or ancestor in the family. The 
actual incidence figure for schizophrenia in proband-ancestors is much less than 
2%. Similar comments apply to the second part of conclusion (1). In any case, 
it is questionable whether the data pertaining to proband-ancestors is suffi- 
ciently reliable to warrant the calculation of incidence figures. The diagnosis of 
proband-parents and proband-grandparents, of individuals who lived in the last 
half of the eighteenth century and the first half of the nineteenth century, is, 
at best, tenuous. 

2. The total expectancy figure for schizophrenia in the children of the simple 
ds is 11.6 per cent. An actual count of the number o 
d children falling into this particular parental sub-group 
ure of approximately 4 per cent. 


to two distinct schizophrenic groups nu- 
ion (2) in mind, this 


schizophrenic proban 
schizophrenic proban 


yields an incidence fig 
bands can be divided in 


3. The pro ? 
clear” and “peripheral.” With the correction for conclusion t. 
distinction between “nuclear” and “peripheral” schizophrenia becomes con- 
fuse : : 
chizophrenia for au undifferentiated group of proband- 
". The schizophrenic category, here, 


16.4 percentage is an inflated figure. 


found to be 16.4 per cent. 
hich admittedly is notas 


children has been 
d ls." Therefore the 


includes the 
based on à method w 


Furthermore, the 16.4% figure is c : à 
. accurate as another method which yields a figure of 14.4%. The diagnostic 
< Iq the various groups mentioned probably do not re- 


criteria for schi 
late to the same set : R 
5. “The probability of schizophrenia 


amounts to 68.1 per cent in our survey, W 
UO heterozygottc parents. 


of the combination of t 
onclusions are either bri 


in the children of two schizophrenic parents 


hile it is only 9-1 per cent for the offspring 
" The 68.1% figure is calculated 


5 For continuity of presentation, these C efly quoted or 


restated in this discussion. 
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Írom data which yield an expectancy figure significantly larger than 10076 x^ 
offspring of schizophrenic parentage will be either schizophrenic or schizoid. 
is possible that overlapping categories, for which there is little justification, are 
involved. The data on which the 9.1% figure is based must refer to proband- 
parents and/or proband-siblings and their spouses—individuals for whom diag- 
nostic errors, and errors due to partial accumulation of data, are the largest. 
6. "The ralio of frequency between the siblings aud children of probands, and 
again between the grandchildren and children, is exactly the sa me for the expectancy 
of schizophrenia and for the mortality from tuberculosis.” This result indicates a 
“genuine gene-coupling of the tendency to schizophrenia and the heredito-constitu- 
tional susceptibility to tuberculous infection.” The constancy of the two sets of 
ratios involves several questionable procedures. First of all, Kallmann's cate- 
gories for schizophrenia in these calcul 
The omission of such cases would completely upset Kallm 


ann's conclusions (see 
more accurate figure of 
14.4% for the 16.4% figure in calculating the ratios would also lead to conclu- 
These two facts alone indicate 
' Assuming the correctness of Kallmann's 
"genuine gene-coupling' 
asoning is probably in- 
act that the calculations 
from tuberculosis do not 


valid, whatever it happens to be, is indicated by the f. 
for the expectation of Schizophrenia and mortality 
involve the same individuals. 


} o other psychosis, Furthermore, the schizophrenic 
offspring of probands develop, on the a 


470) develop the same form of schizophrenia 
es the parent, There are wide differences in 
ding to the disease form. Thus, hebephrenic 
d a total of 32 definitely schizophrenic off- 
) had the Corresponding disease-form. The 


esent writer the number of cases is too 

f mong these various groups, this discussion 
n irequently assumes the reliability of differences based on 
the same, or even smaller, number of cases, 


3 $ : The writer simply wishes to point out that 
certain relevant breakdowns are Possible, from the standpoint of Kallmann's methodol- 
ogy. 


fully reliable differences a; 


re 


x 
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ology, that there is a specific hereditary predisposition for each disease 
form, and that there are sex differences for such specific hereditary pre- 


dispositions. 
Kallmann's 
weight of genetic 


phrenia is the conception that 
factor does not preclude an environmental component. 


methodological procedure, however, implies that environmental differ- 
ences are not related to the etiology of schizophrenia. Kallmann inter- 
prets various expectancy figures? as supporting a predictable Mendelian 
pattern. Since environment is certainly a variable in Kallmann's data, 
arily correlated with genetic predisposition, a Mende- 
in this case, implies that environmental differences 
on the development of schizophrenia. 

al variable uncontrolled in Kallmann's in- 
mily structure. This variable can be 
figures which Kallmann ob- 


general scientific attitude with regard to the relative 
and environmental factors in the etiology of schizo- 
a demonstrated necessary hereditary 
Kallmann's 


and is not necess 
lian methodology 
are without effect 

A specific environment 
vestigation is intactness of fa 
related to certain differences in expectancy 
tained. In this connection, it should be recalled that Kallmann dis- 
o types of schizophrenias— "nuclear" group which 
includes hebephrenics and catatonics and the “peripheral” group which 
includes the paranoids and simple-schizophrenics. The basis for this 
distinction was the finding that the expectancy figure for the offspring 


of the "nuclear" probands is double the expectancy figure for the off- 
spring of the "peripheral" Kallmann interprets the 


probands (p. 108). 
difference between the two groups callines. This difference, 


along biologi 
however, can be interpreted in terms of differing 
of family structure. The “nuclear” group is characteri 
period of hospitalization and an earlier disea 


se onset than the “periph- 
eral" group (sce Table 4, P- 26). 


Both of these factors would con- 
tribute to a greater degree of impairment in the family structure of the 
"nuclear" probands an g. In 


d their offspring a different connection, 
Kallmann recognizes the fact t 


hat the "family structure" of paranoid 
probands is "generally not severely affected and rarely breaks down 
altogether when the psychosis in one of the parents occurs compara- 
tively late" (p. 10). It was this statement, in fact, which suggested to 
the present writer the possibility that intactness of family structure isa 
's investigation. It 


relevant and uncontrolled variable in Kallmann 
should be mentioned, furthermore, that intactness of family structure 


ctancy figures for offspring a particu- 
as related to the presu 


tinguishes between tw 


degrees of intactness 
zed by a longer 


8 Over-all expectancy figures, eXP® of parents of c 
i med heterozygosity 


lar form of schizophrenia, expectancy 
or homozygosity of parents. 


figures 
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parallels the different types of biological matings. Thus, the family 
structures in which both parents are schizophrenics will show the largest 
amount of impairment. 

SUMMARY 


In this review, the writer has dealt only with those points which to 
him seemed to be of major importance. Thus, discussion of the relation- 
ship of Kallmann’s eugenic suggestions to his obtained results was 
omitted despite the fact that eugenic considerations were foremost in 
Kallmann’s thinking. After consideration of the various questions 
which can be raised with regard to Kallmann’s methodology—his diag- 
nostic procedures, statistical treatment of data, sampling procedure, 
uncontrolled variables—the present writer is of the opinion that the 
genetics of schizophrenia is still an open question. It should be empha- 
sized that the present writer in no way wishes to minimize the contribu- 
tion of hereditary factors in the etiology of schizophrenia, however 
unknown the specific nature of these hereditary factors may be. The 
Kallmann investigation, however, supplies no reliable information for 
assessing the genetic basis of schizophrenia. 
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NOTES ON "THE VALIDITY OF PERSONALITY 
INVENTORIES IN MILITARY PRACTICE,” 
BY ELLIS AND CONRAD* 


DONCASTER G. HUMM AND KATHRYN A. HUMM 
Los Angeles, California 


The importance of the problem of measuring personality is evi- 
denced both by the attempts which are being made to solve it and by 
the published discussions of these attempts. 

The currently available standardized measures of personality have 
disappointed a number of psychologists who have attempted to use 
them, and the attempts of these psychologists and the discussions of 
their findings in the literature have been equally disappointing to many 
working in this field. 

The review by Ellis and Conrad of a number of reports of the mili- 
ality questionnaires seems to the writers to perform a 
a number of faults in making assumptions, 
lying mathematics; but this review 
m to us to indicate failures to ap- 
ionnaires—failures as serious 


tary use of person 
service in calling attention to 
in controlling criteria, and in app 
includes some statements which see 
preciate the nature of personality quest 


as those of the military users. 
This is clearly shown by their conclusions, which we feel need dis- 


cussion. 

Their first conclusion is that temperament measures will be effective 
only if they are “specially designed for the group to whom they are 
applied." The standardization of such an instrument, involving as it 
does a case study of every subject to be used, the tabulation of the 
answers of all standardization subjects classified by the temperamental 
groups into which they fall, the analysis of the answers to each question 
to determine which temperamental groups it distinguishes, the further 
results to establish the effect of response-bias and to 
compute compensations for it, the validation of the completed question- 
naire by trying it on a new group of subjects again classified by case 
study according to temperamental groups, and finally the administra- 

representative population to determine 


tion of the questionnaire to a é 
norms—all this is a laborious and expensive process. 
uch labor and expense for each group to be tested 


The repetition of s per j 
would be justifiable, if feasible at all, if it would actually improve the 
results; but, on the contrary, the more one instrument can be used in 
RAD, H. S. The validity of personality inventories in military 


8, 45, 385-426. 


analysis of the 
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ifferent situations, the more data can be compared and the more can 
e learned about the likenesses and differences among people. 

Our experience leads us to the firm conviction that there are person- 
lity characteristics which can be identified by the use of a well-stand- 
rdized questionnaire: that they are to be found in various combina- 
ions throughout the population; that their manifestations may be de- 
rable or undesirable according to the combinations in which they occur 
nd according to the situations to which their possessors are exposed; 
nd that the general application of an instrument which measures them 


? a wide variety of groups is an important way of advancing our knowl- 
dge of personality. 


—some subjects 
s they do not have. 


factor, sometimes causing 
sometimes apparently arous- 
ny characteristics they have 
nstances. However, response- 
tions for its effect can be devised. 
ing attention to its importance, but 

i sposed of by urging truthful- 
ess. 


1g defensiveness and causing them to de 
hich might not be damaging in the circun 


that are more sophisticated," does not 
sree with our experience. A number of psychologically sophisticated 


dividuals have taken the Humm-Wadsworth Temperament Scale 
rcumstances where they had considerable stake in th 
icants for positions) and some of these have shown u 
es. Enough of them did poorly to demonstrate that 
re cannot be produced at will. Again, enough of the 
ologically sophisticated Subjects have been verified 1 
cause us to place more reliance on 
ts than we did originally. 


in 
€ results (as ap- 
nfavorable Pro- 
a favorable pic- 
Profiles of psy- 


—€4— um 
"Tq 


-© personnel administrator. 


. estimate of the extent to W 


The fourth conclusion, that “only limited and specialized demands 
may be made on the inventory technique," also shows a lack of ac- 
ith personality testing. When the data afforded by such 
n a multidimensional fashion from various 
points of reference, the limitations the reviewers speak of will be largely 
eliminated. This is not to suggest that a case study should be limited 
to the use of a personality inventory—in fact, we recommend the oppo- 
site; but it is to suggest that the data obtained from such inventories be 
adequately utilized. Properly interpreted, they can become a valuable 
aid to the psychological diagnostician and to the psychologically oriented 


quaintance w 
measures are examined i 


ith the attempts of the military psychologists 
for predictions of success in training 
low correlations yielded by the pro- 
in the nature of things, have 
intment arises out of the fact 


Our disappointment w 
to use personality questionnaires 
and in service is not caused by the 
cedures used. Such procedures must, 
yielded such correlations. Our disappo 
that those procedures and no others were use 
yielded are being taken seriously. 


The only one of the so-called experiments reviewed by Ellis and Con- 


rad of which we have personal knowledge is the one concerning the 


Humm-Wadsworth Temperament Scale as applied to pilots in training. 
When this project was begun, it was our understanding that it would 
be carried on by individuals who were familiar with the Scale and that 
our collaboration would be accepted. When we learned that the work 
was proceeding along lines that our own experience had already demon- 
strated to be futile, we protested strongly and finally our participation 


was again accepted—but, as it turned out, only on the surface. We were 
furnished the data of age, raw scores, and no-count of 385 subjects, and 
were requested to categorize them according to suitability or unsuit- 
ability of each individual's temperament in such a way that a simple 
statistical validation against success in flying training could be secured. 
This we could not do, since the factors entering into such success are 
complex, and temperament—complex as it is itself—is only one group 
of these factors. However, we did submit an interpretation of each Scale 
from the scores sent us and gave an estimate of integration, that is, an 
hich the temperamental characteristics 
found were likely to be under control or out of control. Since some of 
the temperamental characteristics would be undesirable if out of con- 
trol, and since some of them would be stimulated to manifest themselves 
in the training-course situation, but would not necessarily interfere 


with learning to fly an airplane, a simple correlation of our estimates 


against pass-fail in the course would be inevitably low. 


d and that the results they ` 
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Again, it was our understanding that we would be permitted to sce 
the data gathered in the experiment and to make our own analyses of 
them. Whether the conditions under which these data were gathered 
could have been adequately controlled to provide the conditions called 
for in an experiment, or whether the cases were or could be followed 
up long enough or studied in sufficient detail to provide meaningful 
findings, we do not know. We do know, from our own experiences, that 
the attempts to oversimplify the problem of ascertaining the contribu- 
tion of measures of temperament to predictions of success in learning 
to fly or in adapting to service in the Air Force were doomed to failure 
from the start. The unsatisfactory results obtained are a reflection, not 
upon the measures used, but upon the awkwardness of the attempts to 
use them. 

The senior author of this note has long wished an opportunity to 
state his position with regard to research where one student undertakes 
acritique of another student’s work. His protests against oversimplified 
and ill-considered studies of the Humm-Wadsworth Temperament 
Scale have frequently been interpreted as attempts to inhibit such re- 
search, and his requests to be informed of studies contemplated and of 
their progress have been interpreted as attempts to bias the results. 
This is not, we believe, the case. We believe that scientific research 
should have a free hand. We ask to be informed of studies contemplated 
and to be kept informed of their progress only so that such suggestions 
as we have to make may be given in time to be useful. We do not ask 
that any findings be suppressed, but only that such comments as we 
have to make be considered before, rather than after, publication, and 
that our comments be published with the report of the study. 

We realize that this represents a departure from accepted practice, 
but we believe such a departure would improve the situation with re- 
gard to the publication of research. It would remove from an over- 
crowded literature some futile controversy, leaving room for discussion 
of fundamental differences; and it would enable the busy reader to con- 
sider at one time both sides of such discussions. 


Received February 11, 1949. 
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REPLY-TO THE HUMMS' "NOTES ON 'THE 
VALIDITY OF PERSONALITY INVENTORIES 
IN MILITARY PRACTICE " 


HERBERT S. CONRAD 

Federal Security Agency, Office of Education 

AND 

ALBERT ELLIS 

The New Jersey State Hospital at Greystone Park 
The Humms! have given an interesting account of their philos- 
ophy of personality-test validation. Some elements of this philosophy 
It to fulfill in a wartime military 


would, unfortunately, be very difficu 
nality questionnaires 


situation. If, for example, the data from perso 
“examined in a multidimensional fashion from various 


must always be 
points of reference," then the usefulness of such questionnaires for 
wartime military screening—when speed is generally at an extreme 
premium—must indeed be problematical. 
We agree with the Humms regarding the advantages of question- 
naires which are broadly applicable to all sorts of subjects (although 
the question of comparable intelligence, reading ability, motivation, 
etc. of these subjects would seem to be of some relevance here). It was 
our conclusion, however, from a review of the literature, that personal- 
ity questionnaires seem more effective if “specially designed for the 
group to whom they are applied" (p. 421). The gain in effectiveness 
achieved by special design seemed sufficiently large to warrant atten- 


tion. Apparently the Humms' experience fails to confirm this conclu- 
sion. Additional empirical evidence is needed on this question. 

With regard to the Humms' second main point, concerning honesty 
of response to personality questionnaires, again we would welcome 
additional empirical evidence. The Humms are correct in suggesting 
that merely “urging the subjects to answer truthfully is not the ied 
tion to the problem—nor did we suggest this as the solution. Rather, 
we pointed out that the military situation. (not merely the directions 
accompanying the questionnaires) frequently appeared favorable ns 
reducing dishonesty, and that this fact might account, in part, for the 
higher validity of personality questionnaires in military practice. The 
Humms' statement that "response-bias can be measured and compen- 
sations for its effect can be devised” strikes a hopeful note—but 15 only 


Meagerly substantiated as yet- 
s on "The validity of personality 10- 


M. Note 
chol. Bull., 1949, 46, 303-306. 
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In their third point, the Humms mention that their own experience 
contradicts the suggestion by Altus and Bell that ‘‘more subtle ques- 
tions [than are contained in most personality inventories] would doubt- 
less be required" for personality-measurement of the more intelligent 
_and sophisticated adult (p. 404). This contradiction in the conclu- 
sions of experienced investigators again indicates the need for additional 
evidence. 

As to the fourth point, it seems to us one of the most assured and 
impregnable conclusions from the literature that "only limited and 
specialized demands may be made on the inventory technique" for the 
diagnosis of personality (p. 421). The Humms do not accept this con- 
clusion. Very likely some practitioners succeed better than others in 
interpreting the information from personality inventories, and perhaps 
some inventories prove more useful for this purpose than others. Our 
statement, however, was a general conclusion, based on the experience 
of various experimenters with many of the most widely used inventories, 
and not aimed at any particular inventory or individual. 

We see no general objection to the Humms' recommendation that 
the author of a personality inventory be consulted at all stages in an 
investigation involving the author's inventory. It seems clear, however, 
that if an inventory were in wide use, the author could not possibly find 
enough time to answer all inquiries. In the physical sciences and medi- 
cine, it is not generally necessary to call upon the author of a new pro- 
cedure or technique, before the procedure or technique may be properly 
applied. Some psychologists might argue that such a situation has 
already been reached in the field of personality measurement by the 
inventory technique. 

Finally, let us remind the reader that the Ellis-Conrad review? 
was not aimed against the Humm-Wadsworth scale nor indeed against 
any personality inventory. The review attempted, first, to determine 
"the reasons for the superior showing of the personality inventories in 
military practice" (p. 385); and second, to draw some general con- 
clusions applicable to the use and improvement of personality inven- 
tories in the civilian situation. It is these general conclusions which the 
Humms, principally on the basis of their experience with the Humm- 
Wadsworth Temperament Scale, have questioned. They imply that if 
their specialized techniques of standardization, case history comparison, 
and interpretation of personality inventories were universally followed, 


our general conclusions would not hold. The situation calls for addi- 
tional evidence. 


? A. ELLIS and H. S. CoNRAp. The validity of personality inventories in military 
practice. Psychol. Bull., 1948, 45, 385-426. 
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Selected works. Moscow: State Pub- 


Secnexov, I[vaN MICHAILOVICH]. 
1035. Pp; 


lishing House for Biological and Medical Literature, 
Xxxvi+291. 
This volume was prepared for foreign delegates to the XVth Inter- 
national Physiological Congress in Leningrad and Moscow in 1935. Its 
purpose is to make known to scientists outside of Russia the nature of 
Sechenov's contribution to science. The book includes a twenty-eight- 
page biography of Sechenov, written in English by Sechenov's pupil, 
M. N. Shaternikov, nine physiological papers on chemistry of the blood 
and on neural action, printed in the original German and published be- 
tween 1862 and 1901, and four papers, three of them quite long and all 
of them of special interest to psychologists, which were originally pub- 
lished in Russian from 1863 on and which were translated into English 
for this volume. 

Sechenov was born in 1829. Mosco 


under Glebov at Petersburg (1851). He we 
Tüller and Magnus, 


w was hisalma mater. He worked 
nt to Berlin (1856) to take 
and to do research with du 


lectures from Johannes N € 
Bois Reymond. At Vienna (1858) he came under the influence of the 
great physiologist, Carl Ludwig, who remained his lifelong friend. and 
mentor. Eleven of Ludwig's letters are reprinted in this Life. Next. 
Sechenov got to Heidelberg (1859) to work under Helmholtz and Bun- 
sen and to form a close friendship with Mendeleyev. After teaching in 
Petersburg, he worked with Claude Bernard in Paris (1862). He made 
visits to Munich and Graz (1863). In between times (1860-1870) he 
taught physiology at the Medico-Surgical Academy in Petersburg. In 


1870 he resigned over a controversy about the appointment of Mechni- 
six years. Then he was elected to the chair 


kov and went to Odessa for : c 
of physiology at Petersburg (1876), resigned to go to Moscow as ene 
lecturer (1888), was elected to p] e MATS at Moscow (1891), 
resigned on account of age (1901) and died in : "E. 

The reprinted papers in this volume represent Sechenov pue pue 
pal scientific interests. One of his research fields was the absorption o 
gases by liquids, especially of CO? by the blood. Sechenov m nee 
an improved gas pump which Ludwig immediately adopte in = a i 
ratory. Sechenov’s other chief interest was neural action, especially the 


inhibiti i xes by the a propriate center in the cortex—the 
aaa yore [qune d be eia American students of pum 
Ology used also to hear about the “Sechenov experiment ; you m ES 
on the upper cut end of the spinal cord and find that yon ean ne p ger 
elicit some of the reflexes that are localized lower down. 9ec mE ipea 
his publications along both of these lines about 1860, continue gas 


f neural inhibition up to 1901. 
dE 1 pA um lated into English is The re- 


The first of the Russian papers trans A 
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flexes of the brain, published in 1863. It argues we should recognize the 
dependence of thinking and the other intellectual functions on stimula- 
tion, and thus, supporting a mechanistic view of mind, it justifies our 
naming Sechenov a pioneer in Russian objective psychology. Pavlov 
himself names this monograph as the inspiration for his work on the 
conditioned reflexes. Said Sechenov, “All acts of the conscious and un- 
conscious life are reflexes.” Movement in a brain reflex may be in- 
hibited as is reflex movement in the cord, but psychical acts, he argued, 
are clearly initiated by stimulation. Nevertheless Sechenov did not 
carry his materialism to the bitter end, for he was a valiant defender of 
freedom of action and the spiritual values. He was a man of deep moral 
sense, human sympathy and constant concern with causes (like educa- 
tion for women). The Russian intellectuals liked his book, but the 
Petersburg Censorial Committee in 1866 condemned it as materialistic, 
forbade its sale and initiated court action against its author for the cor- 
ruption of morals. The Minister of Justice, however, in 1867 quashed 
the action and exonerated this enthusiastic, virtuous, beneficent mecha- 
nist. 

The paper called Who must investigate the problems of psychology, and 
how? was published about 1870 and provides a constitution for the re- 
flexology of Pavlov and Bekhterev. The anonymous editors of this vol- 
ume say that the schools of Pavlov and Wedensky (‘‘Wedensky inhi- 
bition") are based upon Sechenov's views. Bekhterev they do not men- 
tion. Both Pavlov and Bekhterev were educated in the Petersburg 
atmosphere, which Sechenov had created before he went to Moscow in 
1888. Certainly Pavlov's views about reflexes, the “psychic juice" 
(gastric) and psychology within physiology make more sense when 
viewed against the background of Sechenov. 

The last item in this volume is the English translation of a small book 
which had already had a second edition in Russian. It is called The 
elements of thought, and its eight chapters formulate Sechenov's mature 
mechanistic views about the nature of intellectual mental processes. 

None of these writings of Sechenov's refers by more than casual 
reference to the work of other authors. Sechenov lectured in the grand 
tradition of the nineteenth century savant, students swarming to hear 
and the public eager to read the pronouncements of the great man. Hc 
was ready with praise for Helmholtz and other distinguished men of his 
time, but he did not provide foot-notes referring to their published 
writings. In spite of his close contact with German and French physiol- 
ogists and the frequent personal letters from Ludwig (who was first at 
Vienna and then at Leipzig), the modern reader feels that Petersburg 
and Moscow were pretty far off from Berlin and Paris. (That was not, 
of course, true of the Petersburg Academy which entertained so many 
western savants.) Nor did the editors of this volume provide the means 
for integrating its content with other scientific work. They do not give 


a 
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P. original places and dates of publication and my approximate fixing 
of dates comes only from the somewhat unsystematic biography that 


starts the volume off. 
. We could do well with more knowledge of the history of Russian 
science. This book, casual as it is in date and detail, helps to give per- 


spective to the later scene in Russian psychology. 
Epwin G. BORING. 


Harvard University. 

BLux, Mitton L. Industrial "psychology and its social foundations. New 
York: Harper, 1949. Pp. xi 4-518. 
Conservative readers of Blum's 7 g 

consider it audacious if not downright revolutionary in its organization 


and content. Unlike all of his predecessors within the last decade, Blum 


has emphasized the achievement of human work efficiency more via the 
vehicle of average employee goals-motivation-and-realization than 
through the more orthodox vehicles of selection and placement testing 
d to two chapters in 


and merit rating. Psychological testing is relegate 
] merit rating is completely ignored except 


the latter half of the book, anc 
for casual mention in a few scattered paragraphs. ; 
This reversal of emphasis—or, perhaps, more accurately, broadening 
of emphasis—from the traditional approach of Viteles, Moore, and 
Tiffin immediately marks this volume as either a novelty in its field or a 
first milestone of a new trend in industrial psychology. : 
In his preface the author states that much of his material is con- 
troversial but he insists that "the social bases of industrial psychology 
must be understood if the promotion of more harmonious employer- 
employee relations in a democratic society is to be given realism and 
meaning." Perhaps his viewpoint is best stated in his introductory 


chapter. 
If industrial psycholo. 


ndustrial Psychology are likely to 


gy is to be distinguishable from “scientific” manage- 
ment and other systems which may border on its field, it must boldly expound 
the view that democratic principles must be applied to industry. Its ue hou 
be the satisfaction of man, not of any one man to the died o! smem a 
of one group over any other. Men must be free to express their cd o eu 
goals, produce, and develop as secure individuals. These and other freedom * 
possible in a democracy and are also possible in an industrial system opera ing 
in a democratic fashion. Efficiency then follows as a necessary accompanimen 
(p. 1). l 
While the reader may wonder how the author proposes to me 
such a statement, he is likely to conclude from the ecg E 


itself that nothing radical is intended. In fact, in the same i 
chapter, he clarifies his conception of the role of the industrial psycholo- 


gist as being a thoroughly conventional and prastas o — aiita 
The psychologist employed in industry has a varied job an 1 
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His main function is to render service to the employer, to the employee, and to 
the various technical specialists in the organization (p. 3). 


After a brief review of the work of psychologists in industry, Blum 
devotes the 36-page second chapter to the "Hawthorne Studies” of the 
Western Electric Company; these studies he considers as marking the 
“coming of age“ of industrial psychology as an applied science. His next 
three chapters (84 pages) deal with the measurement, study, and use of 
the attitudes of employees ("Measuring attitudes in business and in- 
dustry,” “Job satisfaction," and "Industrial morale.") The treatment 
of attitudes is interesting, provocative, and well documented. 

A chapter on incentives is relatively conventional except to empha- 
size that financial incentives have been much overemphasized to the 
inadequate utilization of other psychological incentives. Job training is 
handled well but briefly and a chapter entitled ‘Time and motion 
studies" is treated competently in the conventional manner except that 
this chapter becomes entertaining reading at times when the author 
pauses to poke fun at some of the short-comings of efficiency studies. 
Most valuable is his treatment of "breaking resistance” to change not 
only in introducing more efficient methods but in forwarding other 
worthy changes. 

Blum's treatment of fatigue is not comparable to that of other avail- 
able psychological treatises, but his chapter on "Work environment” is 
a valuable contribution. An excellent chapter on accidents is followed 
by a highly useful chapter on leadership which recommends use of 
Jenkins' nominating technique in industry. The next two chapters, 
"Industrial warfare" and "Unemployment," are frank recognitions of 
stark psychological realities which academic psychologists previously 
have shown a disposition to dodge or ignore. Blum's viewpoint on in- 
dustrial warfare is the modern legal one: 


In the present stage of labor-management relations, collective bargaining 
seems to offer the best solution. If collective bargaining is to be sincere and 


mature, both parties must have relatively equal power; neither one must have 
so much as to force the other into submission (p. 341). 


Blum is likely to be criticized for not mentioning the problem of re- 
sponsibility when he discusses power; it seems reasonable that he de- 
serves such criticism. However his resume of the conclusions of Golden 
and Ruttenberg should serve as a helpful stimulant to inventive think- 
ing in this field, and certainly all industrial psychologists should read 
this chapter. 

The two chapters on testing provide a skimpy treatment by tra- 
ditional standards, but will be consistent with the current trend to intro- 
duce a separate course on personnel testing in the typical psychology 
curriculum. Highly useful introductions (not present in publications 
since the 1942 text of Moore) are the last three chapters on consumer 
research, advertising, and selling. A last chapter on summary and in- 


BOOK REVIEWS 313 


tegration tie together some loose topics and provide the student and 
practitioner alike with a somewhat broader perspective of the problems 
of industrial psychology than can be obtained from most competing 


books. 
WILLARD A. KERR. 


Illinois Institute of Technology. 


HOLLINGWORTH, Harry L. Psychology and ethics: A study of the sense of 
obligation. New York: Ronald Press, 1949. Pp. ix+239. $3.50. 


_ The theme of this provocative and highly readable work appears in 
its two opening sentences: “The key to morals or ethics is the feeling of 


obligation, the recognition of imperatives in thought and action. It is 
the sense of obligation that transforms mere behavior into conduct.” 
Evidently stimulated by Kóhler's treatment of requiredness, Holling- 
worth distinguishes ten primary categories of "oughtness" to which 
people respond, viz., inference, completion, beauty, welfare, utility, 
duty, safety, custom, justice, and (somewhat secondarily) legislation. 
All of these varieties of should designated procedures for reducing dis- 
tress—the notion of the moral, good, or right properly embraces the 
entire ten meanings. 
The author then presents two forms 
scored in accordance with the capacity | 
propositions in agreement with a key derive 
experts; e.g., "This axe ought to have a lon. 
flects the Utility judgment, while "A man oug 


of a 50-item Ethical Insight test 
of the subject to classify the 
d from college students and 
ger handle” normally re- 
ht to tell the truth regard- 
less of consequences" typically illustrates a Duty reaction. The medians 
increase steadily until early college years; post-Ph. D. specialists barely 
difler from freshmen in this measure of discrimination. | 

A novel section rich in clever writing considers the various schools of 
psychology and their power to explain oughtness satisfactorily. Struc- 
turalism and behaviorism are quickly laughed away, but Gestalt theory 
and the hormic approach are criticized with inasi 5) bs Ne 
jected as limited principles unable to account fully E equate ms i 
ten basic categories of "facts." As one would expect ie a pas ene 
dent of the American Psychological Association, he offers Ly : his 
own familiar triadic system, (1) all motives are distresses, (2) al ie 
learning is “cue reduction,” and (3) control is a function of sepe his 
simple pattern is then used to interpret extensively oh wer 
of obligation sampled by his Ethical Insight Test. 7o ingworth i 
this brilliantly and impressively, notably in his ecl t 
version of the sense of injustice as centered pecu € à 
mum of ki ic asymmet i ns lik ; 
The principal emergent suggestion is that optimal vitio adi 
Which converge in some “final common path” after weigh ing 


types of o e appear in any con 
ught as these appe on the Procrustean tec 


No work resting so heavily 


hnique can fail to 
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raise a host of debatable issues, and this one is peculiarly vulnerable 
despite the permanent importance of the topics and the admirable in- 
genuity manifest in their exposition. Either there are less than ten basic 
imperatives in life's affairs or there are more; major values in a pluralistic 
world are probably not confined to this round-numbered schema. Jus- 
tice, duty, and welfare as “drives” seem phenomenally more alike than 
different so far as distinguishable pressures behind actual choice deci- 
sions and subsequent performance are concerned. One need not be a 
doctrinaire Gestaltist to hold that coercive or inevitable inference is one 
manifestation of the completion-tendency in all behavior (or vice versa) 
and that homeostatic tensions to "finish" an indicated total pervade 
most of the special classificatory principles Hollingworth apparently 
accepts as irreducibles. His apparent lack of concern with unraveling 
the complex motivation of any particular imperative proposition opens 
him to the charge of abstract formalism; his preference for "education 
rather than social reform" seems like an inconsistent denial of his own 
oft-affirmed faith in “the natural continuum"; and his claim that 
socialistic philosophies are psychologically unsound (p. 163) is paradoxi- 
cal in view of the obviously powerful mass oughts supporting the trend 
toward democratic socialism in Britain and elsewhere in the contempo- 
rary world. More seriously, one wonders about the anti-hierarchical step 
of making “conscience” merely one ethical category among many when 
it enters methodologically unrecognized as the over-all response "'repre- 
senting action based on full insight and exhibiting full scope” (p. 169). 
But these are minor limitations in an exceptionally original, stimu- 
lating, and rewarding treatise. Its down-to-earth practical quality 
should appeal to teachers and counselors generally, while philosophers, 
social scientists, and psychologists could profitably build many exten- 
sive monographs on the numerous research hints hidden in the rich con- 
tent of this pioneering volume. 


GEORGE W. HARTMANN. 
Teachers College, Columbia University. 


VAUGHAN, WAYLAND F. Sociol psychology: The science and the art of 
living together. New York: Odyssey Press, 1948, Pp. 956. 


"....I have selected topics which seem to be of primary concern 


to the student who is seeking a satisfactory personal orientation amidst 
the social chaos of our time. The central theme is based on the convic- 
tion that human beings can achieve happiness in their social relations 
by discovering just how we've gone about it to make one another so 
miserable. Surely we can profit by our mistakes (p. vii)." This quo- 
tation indicates the tone of theentire volume. It is projected at a homely 
level and is amply stocked with interesting quotations (between 25 and 
30 percent of the volume), a large proportion of them coming from news- 
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papers and the "better" magazines. Its twenty-one chapters deal with 
such standard topics as "The Individual in Social Situations," ‘‘Con- 
vention," "Propaganda," “Prejudice,” "The Family," and “The 
Church," and, on the other hand, with such special topics as ‘‘Values 
and the Art of Living," “The Unemployment of Intelligence," and “The 
Socialization of Intelligence." It is characterized by a willingness to 
make judgments concerning matters of state, religion and person 
most of which are liberal judgments. The book is definitely not aca. 
demic and is really of a type which the nineteenth century called ‘moral 
philosophy,” a free-ranging, evaluative (though not necessarily opin- 
ionated) presentation of many aspects of inter-personal relationships. 
Written in a rather personalized style and with a desirable tinge of 
muck-raking (or is it iconoclasm?), it will make interesting reading 


for beginning students. 
However, the question arises as to whether it is written for be- 
In most universities social psychology follows at 
"chology, so that some con- 


ginning students. 
least one introductory course in general psy 
tinuity can be expected. While this continuity could be methodological 
or material in nature, Vaughan avails himself of neither of these possi- 
ts itself with popularized explanations 
tcriticizes. As an illustration, in dis- 


bilities. Instead the text conten 
llective mind, the following explana- 


not much above the level of those i 
posing of Le Bon's theory of the co e foll. [ 

"The crowd situation brings out 
primitive passions. There 


tion is offered of crowd behavior 

'human nature in the raw' by releasing our 

is no need to resort to the concept of a collective mind to account for 

our barbarous conduct under such circumstances (p. 620)." This is 
wd behavior though ad- 

Aside from the 


hardly an acceptable scientific account of cro 
mittedly it is one which is attractive to the layman. Asid 
y is not more sophisticated as a 


question of whether Le Bon's theory S 
conceptual tool, it is disturbing to think that students might attempt 
to use such an explanation in meeting the complex problems of social life 
Or the volume does contain many suggestions for 
1 the other hand, pe sir ue de 


handling everyday, interpersonal go w 
scribed as wise and undoubtedly effective. — À 
: ri , naive 

i ok was not W ritten for psycho ogically 
Assuming that the bo LE 


students, s ing has obviously gone wrong. There a 
2 ok to mind. One is that the author believes at the 
is, in fact, no continuity between general or systematic psycho! ogy 2 
Social psychology. This is unlikely for he has exe win 
general psychology and throughout the present vo (m = s m 
of the concepts and results of investigators having a ipana y SyS E 
atic bias, e.g., the Yale and Harvard groups. " is rea ya gor i» 
of how the material is usec- The difficulty results, m part, A 
odern social science 


belief that in order to get across the viewpoint of m 
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it is necessary to use the language and notions of the layman, that some- 
how or other any attempt at systematization will defeat itself. There 
seems little warrant for such an attitude. To accept the aim of edu- 
cating the nonprofessional student does not mean that it must be done 
on his terms, for such an approach can only serve to maintain the very 
barrier one is attempting to bridge. It is probably the non-systematic 
orientation of the unsophisticated person which militates against a 
rational approach to social issues, so that it would seem wiser to insist 
that the student at least /ry to acquire the conceptual tools of social 
psychology. Certainly one of the major outcomes of recent research 
has been to demonstrate how inadequate every day "thought-ways" 
are for the analysis and remedy of social problems. f 

. But these are, in a sense, peripheral issues, for what we have been 
discussing are really stylistic decisions; there are more basic considera- 
tions which account for the form of the volume. We cannot escape the 
feeling that this decision to ignore the formal, conceptual processes of 
science for an emphasis on the “here and now" springs from a strong 
sense of urgency. Social scientists today are very sensitive to the role 
which they may and can play as citizens, and there are strong pressures 
to jump in and do something. This responsibility seems to be the heart 
of Vaughan's difficulty. He has acutely perceived the problems which 
face us as members of society, heard the request for answers and has 
attempted to provide them. Unfortunately, in many instances, different 
answers would be just as defensible on the basis of present data. What 
happens then is that personal hunches or guesses are often advanced 
as if they were the answers to the problem which has arisen. The 
reader has not been adequately prepared to recognize these as only 
one set of many possible proposals and correctness by authority is thus 
substituted for a systematic account. Many of the conclusions (we may 
say) seem correct to us but this simply means that we are both on the 
side of the angels. Moreover, a scientific account requires more than 
such mutual agreement, even on conclusions. Thus, while the volume's 
stress on the need to recognize values and objectives in scientific re- 
search is salutary, it commits the error of advancing a specific set of 
values rather than providing some means of analyzing various normative 
proposals. 

These remarks concerning the need to train analytic rather than 
convinced readers seem to justify the following concluding statements. 
It is probably more efficient in the long run to stress the logic of re- 
search, the need for more information, the nature of theory and the de- 
sirability of caution. It is perhaps more damaging to supply inadequate 
tools or a false sense of knowledge to students than it is to hold in check 
a desire to meet one's social responsibilities. 


i RICHARD A. LITTMAN. 
University of Oregon. 
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PETERSON, H. A., Manz 
4 . A., Manzorr, S. S., AND BAYLE 7 
i chology. New York: Macmillan, 1948. Pp. S reip c uA 
wie hgh i poo in educational psychology is clearly a superior 
hogar pice - makes a very useful addition to the literature of the 
LEE Le written, thoroughly documented, and up-to-date in its 
kA ee gal nd in its use of research material. A number of features of 
M eur i y particularly meritorious. These include (1) lists of films at 
pee um which should be valuable for instructional purposes; 
Lade ace ap hasis upon social learning and group participation than is 
erra und in such texts; (3) stenographic reports of actual classroom 
r g eei qe which can be studied by the student and evaluated 
m — : Henn psychological knowledge; and (4) excellent study ma- 
ew ists of readings in connection with the individual chapters. 
dut ;ook devotes chapters to all the usual areas of educational psy- 
ogy, namely, growth and development, learning, personality ad- 
the general field of learning re- 


eee and evaluation. However, 
ei : : i 
ves the major share of attention, with measurement and evaluation 


T ranking second in terms of allotted space. In the opinion of the 
tal pe er, hardly enough attention is accorded the general areas of men- 
al hygiene and personality adjustment. The one short chapter dealing 
n: this subject is excellent, but a more extended treatment of class- 
Polus problems in this field would probably have added materially to the 
h e of the book. The added material on mental hygiene could perhaps 

ave replaced some of the less valuable discussions of retention in learn- 


ing, fatigue, diurnal variations in efficiency, etc. 

is a textbook also seems to suffer somewhat from a weakness which 

ie. ually universal among books in educational psychology. This 

wi coming has to do with its partial failure to provide the student 
h an integrated set of pr hich can be used to ex- 


lai inciples or theory W 
plain behavior in its many manifestations. A student who masters a 


book of this type will tend to possess many individual facts, and know 
the results of numerous experiments, but will scarcely have a unified 
Psychological "frame of reference” which is clearly understood, and 

ations faced by teachers. 


which can be applied in the many diverse situ 
hich can be directed toward most 


This criticism, however, is one W 

field of psychology- Perhaps the science of psy- 

hed the stage of maturity which would make it 
ories or generalizations 


far in setting forth the 
neering the learning and 


hose engaged in engi 

It would seem, however, that textbook writers of 
effort in the direction of 

here this is possible. 


ae ustment process. 
ee psychologies s 
viding basic theory in th 
T eterson's Educational Psychology covers in 
that ral often treated in general psychology 
e general treatment of the subject 1s suc 
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particularly useful in those courses in educational psychology which do 
not require general psychology as a prerequisite. It is, however, clearly 
a text of such thoroughness and breadth that it undoubtedly will be 
widely used in more advanced educational psychology classes. 

On the whole, Peterson and his collaborators are to be commended 
for having written a most readable, practical, and authoritative text 
which will exert a very wholesome influence upon teacher training pro- 
grams. 

Grenn M. BLAIR. 

University of Illinois. 


BRAY, CHARLES W. Psychology and military proficiency. Princeton: 
Princeton Univ. Press, 1948. Pp. xviii 4-242. 


This book should be required reading for Army, Navy, and Air Force 
officers. In summarizing the history of the Applied Psychology Panel 
of the National Defense Research Committee, Professor Bray has done 
a service for psychology and for the military. He shows how it came 
about that civilian psychologists were enabled to render notable aid in 
the solving of many of the human problems of the military during World 
War Il. Many of these problems were completely new to psychology 
and it is a remarkable tribute to the ingenuity of the Panel's investiga- 
tors thay they succeeded in adapting the techniques of experimental 
psychology to this use. 

Professor Bray brings order to what has been a confusing picture. He 
delineates clearly the relationship of the Applied Psychology Panel 
to other wartime research agencies, to the Panel's 23 contractors, and 
to the various military agencies. 

The results of the Applied Psychology Panel's work have been re 
ported in 513 separate documents. Too frequently the author is forced 
to note that a particular report is "not available to the public." While 
it is understandable that some of the investigations may possibly con- 
tinue to involve national security, it is to be hoped that declassification 
is not being delayed only because of lethargy on the part of the responsi- 
ble military agents. While Bray's descriptions of the various researc 
projects are necessarily brief, he does give the reader enough of a taste 
to whet the appetite. To find that the original research report is not 
available is most frustrating. 

The program of the Applied Psychology Panel was carefully planned, 
planned in such a way that optimal use would be made of the limite 
personnel resources available to the Panel. This personnel limitation 

was more serious than is generally known. A count of the people men- 
tioned by Bray as having been engaged in Panel work discloses that 
fewer than 200 individuals were involved. Because of this limitation 
the Panel was forced to reject many service requests for help. Early 
in the work, the major emphasis was on classification and training. This 
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em i i 

Cs: on eit gradually until, at war's end, a large portion of the 

of c bani was concentrated on research on design and operation 

at t = on achievement and proficiency tests. 

disney n : ray, in the final chapter, “The Future of Military Psy- 

pad old hs a € perfectly clear what the role is that psychology should 

uly: ike ursi in the military services. He summarizes most effec- 

eei : nditions under which psychology would be able to make its 
st contribution to the military and he challenges the military to 


provide those conditions. 
GLEN FINCH. 


Headquarters, U. S. Air Force. 


Eysenck, H. J. Dimensions of personality. 

1948. Pp. xi4-291. $5.00. 
ate of knowledge, any large-scale investigation of 
tant event. This book is an account of the work 
jill Hill Emergency Hospital by a number of 
rking together. The institution is 
cho because of neurotic 


New York: Macmillan, 


In our present st 
personality is an impor 
that has been done at M 
psychiatrists and psychologists wo! 
one which served chiefly men and women v 
characteristics were unable to adjust to army life. Since most of the 
specific studies have already been reported in journals, Eysenck dote 
not attempt to describe them all in detail but tries rather to bring 
them together into a coherent organization and fit them into the con- 
arch on personality. He does this well, analyzing prob- 
ancies from study to study. His definitions of 


fying what he and his co-workers mean by 
y helpful in this connection. 


he first task was to identify the principal 
be measured. For this pur- 
trists on 39 traits. Factor 
tified a general factor they 
neral neuroticism. The 
anxious, inhibited neu- 
it dysthymia-hysteria 


pose, 700 patients were ra 
analysis of correlations betwee 
called P, general personality orga 
second factor, bipolar in nature, 
rotics from hysterical, asocial ones. 
or introversion-extraversion Freudian 
Sense). These two factors together or 26 per cent of the 
variance. Two other bipolar factors, representing eight per cent and 
Piel per cent of the variance respectively, were considered to be of little 
undamental importance. 
wä Progi here on the research activity con: 
af ih in which the two basic dimensions 
«den her factors. A very catholic attitud 
an "d activity. Physical and physiological m 
kind personality questionnaires, laboratory 
5, and projective or expressive technique 


ploration of the 
a wide variety 
haracterize 

ability tests 
f several 


sisted of an ex 
are related to 
e seems to have € 
easurements, 
performances o 
s were all tried. 
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A few strikingly significant relationships emerged. General wy Él 
cism seems to be related to a significant extent to poor scores 2: dou. 
vision tests and to body-sway measures of primary suggesti ee ot 
Dysthymics are clearly differentiated from hysterics in some ren 
their level of aspiration scores. There are many other suggestive -— tha 
and promising techniques, such as the Ranking Rorschach Test a ien 
analysis of responses to humor of various sorts. In the last c pot 
the author brings together the bits of information into a compan 
picture, but there are really not enough fairly certain facts to make 

onvincing. 

The soxalcosesen in the work are those which would be likely so -— 
up in a situation where many individual investigators branch E n 
separate paths without much central control over formulation o fea 
lems and methods. Although it is stated that 10,000 subjects wens 
involved in the whole undertaking, the number of cases in many a 
dividual studies is entirely inadequate to settle the issues being inves ti 
gated. Ways of showing relationship between the basic dimensions a = 
other traits vary from one study to another, and many of them prese d 
data in an awkward form which makes interpretation difficult ge 
comparison with other studies impossible. One wonders wh ythe weigh te 
Scores for neuroticism and introversion which were used in ploton 
Figs. 4 and 5 could not have been used in all the separate studies an 
correlated with measurements of the traits being investigated. 

The most serious limitation of the work arises from the non-repe^ 
sentativeness of the population, especially that upon which the initia 
factor analysis is based. Patients at this hopsital were not even repre- 
sentative of neurotics in general, and men and women appear to m. 
been differently selected. The author, of course, recognizes this di 
culty, and it is to his credit that he describes the experimental aor 
in some detail and brings in comparison data from normal population® 
wherever possible. These comparison groups, however, are very ppm 
and often very small (e.g., twenty normal controls were used in th 
exercise response study, p. 103). Sometimes no information is given a$ 
to how they were chosen (e.g., Ranking Rorschach study, P- a 

In a largely uncharted area like personality research it would 
unfair to judge a preliminary voyage of exploration by ideal standards, 
Limited data are certainly of a great deal more value than no data = 
all. The main task accomplished, as the author sums it up, is that ~ 
“isolating two personality dimensions and discovering a series of = 
which enable us to perform quantitative investigations along ES is 
two dimensions." This alone is a considerable accomplishment. rich 

as a framework for further quantitative personality study and Ms ad 
mine of ideas for laboratory and test methods by which one er is 
ceed that the book will be most useful. It is in no sense the las 

on the subjects it discusses. 


i Leona E. TYLER- 
University of Oregon. 
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WmurrE, Ropert W. The abnormal personality, a textbook. New York: 

Ronald Press, 1948. Pp. x+613. $5.00. 

In The Abnormal Personality, Robert W. White contributes enor- 
mously to the need in teaching for an integration of personality theory 
with the traditional precepts of psychopathology. It is apparent that 
the author is well grounded in sound, experimental psychology, and ex- 
perienced in actual clinical practice, and that for some time he has been 
preoccupied with the problem of an amalgamation of these in the form 
of a theoretical structure which would be useful in teaching. Perhaps 
more important than these characteristics, however, are two which the 
reviewer would like especially to emphasize. Manifest throughout the 
book, these are qualities which, in combination, seem today to be ex- 
ceedingly rare: scholarship and good writing. This reviewer would like 


to discuss each in turn. 

The reader of The Abnormal Personality is aware soon enough that 
White's position is very clear with reference to much of the theoretical 
controversy regarding personality, involving as it does psychoanalysis, 


Kraepelinian psychiatry, present-day emphasis on clinical psychology, 
"organic" vs. functional etiology, psychosomatic, medicine, etc. He 
seems to have assumed it as his special task to disentangle from this 
complex the threads of significance, with the possibility that these might 
form a pattern which the student in the course 1n Abnormal might 
assimilate. In doing this, White has been almost compulsive in his use 
of basic literature. It is apparent that the author has read widely. But 
his writing indicates that he has read not only widely but carefully, and 
that he has been selective in his references. His study has been inten- 
sive as well as extensive. Because of this feature alone, one might at- 
tribute scholarship to his book. But a factor which counterbalances and 
yet supports this is the author's willingness, when facts are scarce, to 
put his best foot forward, humbly but with common sense, and sen the 
other foot firmly on the ground. It seems to the reviewer € we a 
here an exemplification of psychological study—examination of re- 


inati i incorporation 
search findings, combination wit n p 
with most reasonable durent e t T— 

In regard to the matter of goo 

easily aang from its first to its last page The bere m er 
made a real effort to use ordinary language to establish panen ys 
which the old-fashioned jargon of traditional psychopat D ogy d 
ably confuses with its Hellenicized mumbojumbo. It som erbe à 
to the author, clear speech and, what is most important, clear thi g 
are of paramount consideration. 


; , 
In certain areas the reviewer is not in agreement with the author's 


iti i ion i d to psychodynamic 
Position. (1) With a healthy orientation in regard to Ps} y 
probabilitice White seems to take Sheldon priy tsi pret 
a seat reserved for the geneticists if they should happ M 
with anything. (2) Goldstein's position with reference fo er n 
Possibilities is well presented provisionally, but it seems to 
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adopted generally with reserve. (3) White is at his best when discussing 
neurosis and its dynamics; regarding psychosis he seems to take much 
more refuge in the authorities. 

Essentially, however, this is an approach to the subject-matter of 
"abnormal psychology" on the one hand, and "personality" on the 
other—areas traditionally disparate, but actually synonymous—for 
which the reviewer, in particular, has been looking for a long time. 

T. W. RICHARDS. 

Northwestern University. 


MacKixNoN, Donatp W., AND HENLE, MARY. Experimental studies in 


psychodynamics: A laboratory manual. Cambridge: Harvard Univ. 
Press, 1948. Pp. ix+177. 


The disparity between the amount of speculation in the field of 
human motivation and personality, and the quantity of experimental 
techniques and evidence in this same area has been disconcerting for 
too many years. Therefore it is rather pleasent to note the advent of a 
laboratory manual which has as its modest purpose the presentation 
of 14 experimental designs to fulfill the laboratory needs of a semester 
course in the experimental psychology of personality. Such a course, 
no matter how limited the implications of the experiments may be, will 
have the highly redeeming feature of focussing the attention of students 
of personality upon experimental problems and will force them to think 
in terms of experimental design. In the ability of this manual to con- 
tribute to such a course lies its chief merit. 

In addition to its mundane pedagogical aspects, this book has cer- 
tain theoretical implications. If one were to accept uncritically the 
panegyrical descriptions with which some theorists characterize their 
theories, one might be seriously misled as to the degree of maturity the 
science of psychology has achieved. Fortunately, we can examine what 
psychologists do in addition to what they say they do. Since the authors 
of the experimental manual are kindly disposed towards Lewin's theory: 
the experiments, many of which were initially reported by students of 
Lewin in German periodicals, are considered within the framework 0 
topological and vector psychology and consequently an opportunity to 
examine Lewin's theory on the experimental level is gained. 

A rather interesting and significant methodological point that i$ 
repeatedly raised by the authors is that the experimental behavior 
must “be understood in terms of the psychological field in which it oc- 
curred” and that "it is insufficient to be able to specify the characteristics 
of the physical situation in which the subject is placed and to correlate 
these with the experience and behavior of the subject." They admit that 

it is a rather difficult task to determine the subject's psychologic 
feld but nevertheless assert that it is possible. Clues as to the nature 
of the subject's psychological field can be gained from the observation 


0 
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be ME s behavior and his verbal reports. But “these clues must 
gained subtly and examined critically." One wonders about the 


fruit i 
tfulness of such an evasive construct. There is nothing methodo- 
heoretical constructs as reflecting 


E etc incorrect in interpreting t 

nw weis ined] experience, but by so doing one does not escape 

eset E ge of objective, operational usage of concepts. It has 

Macki o the reviewer that the system of constructs used by Drs. 
Kinnon and Henle has been somewhat more phenomenologically 


oe than theoretically and empirically consistent. 
e manual was used by the authors with students who had some 


— with experimental procedures in psychology. One might 
enon gate that training in clinical psychology would be advisable since in 
ae inguin a subject should be chosen who is lively, active and 
A t gen one who takes pride in what he does, who likes to do things 

ell" while another experiment requires à subject “who is sensitive and 


creative, though not at all a prima donna." 


d for a four-hour laboratory period. 


wi Each experiment is designe 
ith the exception of a few simple pieces of apparatus the laboratory 
se is supplied with the manual. A booklet 


material for the entire cour 
for the instructor has been pr 
those who find themselves in 

this manual can serve as perimental course in "psycho- 
dynamics." The manual should be of interest to anyone who teaches 
a course in general experimental psychology. Several of the experiments 


provide helpful suggestions. For 
t with "the field" approach, 


can be used to advantage in such à course. 
Howanp H. KENDLER. 
New York University. 
Ewen, Jony, H. Mental health. Baltimore: Williams & Wilkins, 1947. 


Pp. 270. 

This volume, written by an English psychiatrist, is intended as "a 
practical guide to the disorders of the mind." Its title Mental Health 
deals neithe: inciples governing the 
oblems of mental 


is quite misleading since it 
normal personality nor, tO any great extent, with pr 
the author has given u$; in essence, another 
ight of the ever grow- 


hygiene. Asa matter of fact, I 
volume of familiar descriptive psychiatry. ) 
s in the e and behavior 

fronted with a 


ue array of publication e 
normalities the reader may justifiably ask, : 
id here that has not been said before or what 


"ia ee what is being sa! hat | 
iewpoints and concepts are being introduced? The present volume 
meets both questions in a very unsatisfactory fashion. : 
E... author has intend pendium 
of i ee Si to the field ! 
bese reviewer, however, his only 
vity. In 270 pages he has not 


mental diseas 


ed this bo L 
1d of psychological medicine. In the opinion 
achievemen 


dealt with t 
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atology, classification and description of the various mental disorders, 
but also managed to include a rather lengthy chapter dealing with Legal 
Acts governing mental disease in England as well as various certification 
procedures applied to medical officers in mental hospitals. Regarding 
the intended simplicity of conveying the subject matter, the author has 
failed conspicuously. Particularly in the discussions of general concepts 
the beginner will be confronted by a mass of highly ambiguous terms 
and concepts, undefined and stated in an uncritical and regrettably dog- 
matic form. The more sophisticated reader will be disturbed by unfor- 
tunate inaccuracies as well as by the author's rather archaic definition 
of "mind" and his apparent obliviousness of more modern concepts of 
personality mechanisms. The short glossary of technical terms is rather 
inadequate and shows a poor selection. : 
© While certain readers may find some use for the chapters dealing with 
the legal aspects of mental disease in England, the beginner interested 
in being introduced to the field of mental illness is likely to become more 
confused than educated. 
LUDWIG IMMERGLUCK. 
University of California, Berkeley. 


MENDEL, ALFRED, O. Personality in handwriting. New York: Stephen 
Daye Press, 1947. Pp. 375. 


Rudolf Arnheim, introducing Mendel’s book, presents graphology 
as a projective technique, while the author himself calls it “the psychol- 
ogy of man’s finest expressive gestures." In the opinion of the present 
reviewer projective and expressive methods should be differentiated 
from each other. Projective methods (Rorschach, TAT, doll-play etc.) 
diagnose a subject’s projections of ideas upon a given, more or less 
structured material, while the analysis of gestures or expressive move- 
ments deals with a direct motor expression which is neither projected 
upon structured material nor transformed into any content. 

While many non-graphologists explain graphic gestures by learning 
and by chance, Mendel starts with the assumption that graphic move- 
ments are patterned by personality. "Every gesture may fuse three 
currents in man: the emotional, the intellectual, and the physical. 
Likewise every stroke of his pen." However, with the admission, " 
have speculated freely, drawing tones and shades from the slightest 
clues," Mendel plays into the hands of those who condemn graphologY 
and does not do enough justice to himself. He relies in many ways on 
previously tested relationships between certain graphic movements an 
certain personality trends and between certain movement configura- 
tions and personality patterns. The lack of experimental data, O 
methodology, and of a bibliography is partly offset by detailed tables 
of interpretation and by the use of graphology in diagnosing normal an 

abnormal aspects of personality, supported by empirical observation. 


-a NO 


had 
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Especially interesting is the chapter done on "Physiology and Path- 
ology in Handwriting" by Alfred Kanfer, who validated his grapho- 
logical observations of patients by hospital diagnoses. 
' Mendel follows the statement of Gardner Murphy, which he quotes 
in the beginning of his book, namely, "not to maintain that the interpre- 
tation of character by handwriting is a science; its most devoted ad- 
herents would make no such claims." As far as graphological diagnoses 
can be validated by empirical experience, as far as successful predictions 
can be made far above chance expectancy, as far as graphic movements 
can be measured in their pattern and evaluated in their expressiveness, 
as far as the methodology is consistent, graphology has to demand a 
scientific recognition. In the opinion of the present reviewer, the direct 
analysis of expressive movement without intermediary material for 
projection potentially a e precise and more defined applica- 
tion of scientific criteria than projective methods. Mendel's illustra- 
tions and interpretations en i dictionary about the language of 
personality but they do not reveal the laws of its grammatical structure, 
those laws which govern graphic movements as one of the many 
expressions of personality. When Mendel states, “J hope the experi- 
e them, then reject 


mental psychologist will take my hunches, examin 
or verify and refine them,” the present reviewer is one who supports 


this appeal 
id WERNER WOLFF. 


Bard College. 


GrsELL, ARNOLD, AND AMATRUDA, CATHERINE S. Developmental diag- 


nosis. (2nd Ed.) New York: Hoeber, 1947. Pp. xvi--486. $7.50. 


The revisions of, and additions to, the first edition of this book 
hardly justify publication as à new edition. Eleven of the 19 chapters 
of the first book remain completely unchanged. One chapter, called 
"Developmental Pediatrics," has been added to the new book. The 


first edition chapter entitled “Special Sensory Handicaps,” has been 
divided into E oci on " : “Blindness” for the new book, 


and illustrative cases have he only other changes in ne 
chapters entail the addition of a section on aphasia, ee bod 
Paragraph in one chapter; and the addition of two paragraphs dm 
new chapter. Other changes in the book of any consequence ; 
addition of material on dlectro-encephalography, in diagnosis o convu a 
sive disorders, a brief section on visual cerebral injury; and up re 
à small amount of material from another recent book of the E n 
author, “The Embryology of Behavior.” Additions to the appen ices 
include a new list of Dr. Gesell's films, 2 very brief list of child guidance 


books : i Dr. Gesell's name as an author), 
th t include Dr- / 
(three of which do no eol training for developmental pedi- 


and a short appendix on pro 
atrics, 
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As in the first edition, the material is presented in a chatty, informal 
manner with only rare references to the experimental or clinical litera- 
ture. Many vague, unscientific concepts expounded in the first edition 
persist unchanged in the second edition. The book no doubt has some 
value for the practitioner, but it offers little to the psychologist or child 
development specialist. 

MARGARET R. KUENNE. 
University of Wisconsin. 


TYLER, LEONA E. The psychology of human differences. New York: D. 
Appleton-Century Co. Inc., 1947. Pp. xiii 4-420. 


The psychologist who would write a text on differential psychology 
encounters a number of problems. The appraisal of the text can be based 
in part upon the way in which these problems are faced. ‘The author 
must decide: (a) whether to strive for a broad sampling of topics or for 
thorough discussion of particular problems; (b) how to relate the expert- 
mental findings to the controversies and the diverse points of view 1n 
the field; (c) how to approach the statistical problems which underlie 
any treatment of differences between individuals. 

In her book, Dr. Tyler has resolved the first problem in favor of a 
broad coverage of a variety of topics. She undertakes to discuss both 
group differences and individual differences. She presents chapters 
both on the feeble-minded and on factor analysis. She treats both the 
heredity-environment issue and the use of test batteries for the predic- 
tion of vocational success. Since the total book is only 400 pages, the 
treatment of many topics is suggestive rather than exhaustive. At the 
same time, however, the book provides a good introduction to a wide 
range of problems relating to individual differences between persons 

In her presentation, Dr. Tyler tries to distinguish between the facts 
which have been gathered for a particular area and the interpreta- 
tions which are to be made of them. Her point of view, to which the 
reviewer would subscribe, is that psychologists differ very much more 
with respect to what the facts of individual differences mean than with 
respect to what the facts are. Each chapter undertakes to present 2 
selection of the most significant facts in a field, and then to discuss the 
problems of interpreting those facts. The evidence selected for presenta- 
tion seems generally well chosen, and the interpretation of the facts 
seems to present a mature point of view which avoids doctrinaire pos!” 
tions on either extreme. Maybe this merely means that the point of view 
is in most cases one with which the reviewer finds himself in agreement: 

What to do about statistics is always a difficult problem for this 
type of a book. One has the choices of (a) writing a book for which no 
statistical competence is needed, (b) assuming that each reader alrea Y 
has competence in statistics, or (c) trying to include instruction ?? 
statistics in the text on individual differences. The first choice nece? 


—M 8 lÁÀ 
YIN 
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hich is superficial to the point of triviality 
unrealistic for many groups and institus 
he book. The third choice puts a distres- 
on upon the author. A chapter of 


sarily results in a book w! 
The second choice is probably 
tions which would want to use t 


wd responsibility for a dual functi 
statistical instruction often seems not to fit well into the book as a whole, 


I: Idi same time provides a very unsatisfying course in statistics. 
dba er has adopted a variant of the third technique, but the statis- 
Kid aterial is quite limited in amount and makes no pretension of 
ms ping competent computers. It is slipped in rather unobtrusively 
b various points in the book where it 1s needed, and would appear to 

e fairly painless. However, one may question whether it provides 
sufficient background for the student to read with very great under- 
standing the sections of the book dealing with use of tests for prediction 
and with the effort to develop pure tests of primary abilities. Perhaps 


no better solution is possible for this very troublesome problem. 

In general, Dr. Tyler has written a clear, readable, succinct book, 
presenting an overview of many of the problems of individual differ- 
ences and a sound appraisal of the difficulties both in conducting the 
research and in interpreting the results. The text should be quite useable 
both for advanced undergraduate students and for graduate students 
whose field of specialization lies outside the limits of differential psy- 


chology. 

ROBERT L. THORNDIKE. 
Teachers College, Columbia University. 

ALscuuLER, Rose H., AND HATTWICK, LA BERTA W. Painting and 
personality: A study of young children. Chicago: University of Chi- 
cago Press, 1947. 2 vols. Pp. xi 4-590. $10.00. 
Painting and personality is dmark in the study of 


personality and in the study 0 c It is equally note- 
worthy in the annals of publishing and printing techniques: seldom has a 


publisher contributed so greatly to the presentation of research as has 
University of Chicago Press in the 120 magnificent full-color plates by a 


new process. 


an important lan 
f early childhood. 


was far from common- 


Beginning in 1937, when su com! 
place, Alschuler and Hattwick undertook to study through individual 
Case studies and quantitative group comparisons the dynamic relation- 
ships between inner personality tendencies and spontaneous creative be- 
havior in 2-5-year-olds. With regard to the general expressive hypothe- 
Sis, Alschuler and Hattwick confirmed the expectation of individuality 
A each child's “characteristic and unique abstract use of color, line, 
form, and space" (p. 3). But beyond this their data revealed to them 
,& very general, and perhaps even universal, tendency for all sorts of 
individuals to express similar feelings, reactions, and problems in like, 


9r at least comparable, fashion" (p. 4)- 


ch an approach 
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These findings are based on observations and daily — a 
children over one or two years from various social settings. r n t Aa 
vidual analytic procedure began with the finished paintings. Each ble 
analyzed in terms of use of mass, line, form, space, color, peque 
content, verbal comments. Then, behavior and personality dar on e 
same day were examined and an attempt made to note relations ap 
lack of them. Appearance of consistent trends in various children es 
duced hypotheses for later quantitative study. The forms for brea E 
down the daily qualitative records of behavior in creative and free p id 
situations and the charts and summaries for compiling the bow : 
data over extended periods may well be useful to other workers P ir 
field (pp. 224—259). One case study is given in extended summary pi 

Group analysis was carried out by determining which children E 
ferred each research medium; and within the easel painting some, 
which children showed each of some fifty-odd characteristics (e.g dei 
phasis on Red, Emphasis on Cold Colors; Intermingling of C ee fugi 
vs. Color Emphasis; Diagonal Strokes; Dotting; Painting Off t P ls ae 
etc.). Children showing a certain characteristic were compared wi ss in 
group as a whole or with children showing a contrasting aan 
their paintings on a series of 113 personality characteristics unde rie 
general headings (pp. 235-237). Tables and interpretive summ? ore 
present those relationships (pp. 381-582). “Only when the chances ee 
85 or more in 100 that the observed tendencies would be found in @ es did 

` sampling, was a given relationship included in a table” (p. 381—3t216 
original). — 
It seems to this reviewer, however, that still greater caution 15 Hes 5 
sary in statistical terms. Admitting relationships at a level ° ovet 
chances in 100” means, of course, reliance on critical ratios barely f di 
1.0. Moreover, some of these relationships apply to a minorily © 


ac* 
children in a group. For example, among the traits found to be char? 
teristic of children emphasizing red in their palettes is one labeled -able 
pendent on adults for affection.” Examination of the appropriate ing 
(p. 387) shows this trait was found in 3997 of the 46 children shonas 
emphasis on red. Among the first 300 of several thousand critical rê 
presented in the tables the reviewer counted only 4 of 3.0 or more. ya 


; F $ u 
The major findings of the study, based apparently on both indivi" he 
and group quantitative analyses appear in the first 165 pages e not 


work. The major conclusions (one would surmise: the structural plat ey 
the book is not always clear) appear to be those stated on page 14. gei 
are in fundamental agreement with Rorschach's analysis of percet 
organization and are worth quoting in full: of i” 
a) Color tends to give the clearest clues as to the nature and degree 
tensity of the child’s emotional life. i amount 
b) Line and form tend to give the most intelligible clues to (1) t e Tising 
of energy the child is expending, (2) the degree of control the child is € 
(3) the direction in which that control is operating. 
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Color and line and form considered together are likely to indicate the bal- 
ance which exists between the child's impulsive drives, on the one hand, and his 
overt, controlled behavior, on the other hand. 

_ €) Space usage and spatial pattern tend to g 
inner life than a picture of the child as he relates, 
ment. 


ive less a picture of the child’s 
and is reacting, to his environ- 


ors follows in rich detail, supported by 
tables already described and—rather 
f the behavior of children showing 


The elaboration of these fact 
references to the quantitative 


more impressively—by descriptions o. 
markedly the painting-characteristics in question. These materials 


will provide the starting point for new research in this field for years to 
come, and will undoubtedly be of extreme value in the clinical interpre- 
tation of children's painting. But most disturbing here and throughout 
the study is a kind of ambivalence. On the one hand, e.g., the personal- 
ity characteristics of children preferring red, blue, green, yellow etc., 


respectively, are outlined. On the other hand, the authors find it neces- 
n dangerous generalizations, not in- 


sary to emphasize that these are us § 15, 
tended for sweeping application. Each painting characteristic has a 
Polential significance but it is ‘“‘more often in the interrelationships of 
several aspects than in any single characteristic that the distinctive and 
telling qualities of the child's products are likely to be.” But, if this is 
so, one wonders why so relatively enormous à preponderance ems x 
search itself and of its publication are devoted to the thin gruet o the 

gle painting traits 


admittedly inappropriate study of the relation of sin 
to single personality traits, particularly as the admirably complete pres- 


entation of the results shows them to have such disappointingly low 
Statistical significance. Nor is there sufficient guidance to those who 
would follow in how to weigh the "interrelations of trends —for it is 
seductively easy to take separate factors and characterize he n 
exhibits them, even though one is warned against doing SO. Dis am i 
alence is not simply that of the authors but, to this d E vo s 
rather clearly the state of psychological Lap curis 1 pon 
general. We are torn between the obvious deman po d a 
structure itself that it be viewed holistically, and att ái pa Ee 
Of the research techniques available force us to an erenn an 
ism. Alschuler and Hattwick have done yeoman service i^ wo 
development of each horn of this t but ee pa : ED 
© get fr o the other. e pervasi work, 
nd kae pe ft im, HES in the assumption that each aspect id -— 
Be vi be “of necessity, studied steers Lyn e 0er abus, z a 
Which th . ting might be relate T A . 

FEVlewer's italics). or major step in personality research will 


come a. š s that necessity- 
Poan din LAWRENCE JOSEPH STONE. 


Vassar College. 
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z ;nical 
PENNINGTON, L. A., & Bere, I. A. (Eps.). An Introduction to clinica 
psychology. New York: Ronald Press, 1948. Pp. xv+595. 


. . " " itors 
This is a multi-authored volume which is unique in that re tye 
have not written any of the chapters, and indeed, it is diffic t Sor t is 
tain what role it was they performed in producing this boo i b 6s 
this reviewer's impression that we have here à decidely rm ien 
with a modicum of editing and chapters which yary d M e caly 
Although the editors state that the book is “to describe s eed 18 
at the introductory level what clinical psychologists do, A modd 
know what is meant by the introductory level. If it is the s st zo in 
graduate study, much of the book is repetitious of materials PE a 
competent undergraduate courses. Is the introductory es H is. dee 
other hand, represented by an undergraduate course, pl cate 
liberately organized around a book such as this? Perhaps p dandi 
tribution of this text may be in smaller institutions faced iss meh 
lem of teaching an undergraduate course in "clinical Www Prob- 
The book is divided into five parts—Introduction ; C ome 
lems; Clinical Methods; Psychotherapy; and Professional Re e the fact 
While the editors should be given credit for an awareness o ductory 
that an orientation to psychopathology is necessary for the introc dow! 
student, there is a logical, rather than a psychological, ten, nel 
among these various parts. Thus, the coztent of abnormal paye logy 
(Part II) is presented as distinct from the methods of clinical psy s somes 
(Parts III and IV). The section on methods, for example, S 
largely a matter of straightforward description of techniques W! oW 
much discussion of the way these techniques operate except m era 
case histories. A review of the various chapters may give a clearer € 
cept of the pattern of the book, and point up its unevenness. 


. inical 
Part I, “Introduction,” opens with Chapter 1 on the "Meaning nem 
Psychology" (R. B. Cattell). While the training situation is treated "clinica 
and forcefully, some may take issue with the author's statement that natura 
training applicants are those who cannot face the "discipline of the de 
Sciences." There are others, too, who would doubt that there is more fu r intui- 
clinical psychology through factor analysis than through more dynamic 9 
tive approaches. 1 vior?" 
Chapter 2 is a discussion by O. H. Mowrer on “What Is Normal Be M istical 
Here it seems that the editors could have eliminated the discussion of hich is 
normality, psychographs, invalidity of pencil-and-paper tests, ett gym 
probably repetitious for the student. The chapter is cleverly di ity, P i 
posium of specialists in various areas concerning the problem of Hd i give 
the attempt to interject the viewpoints of the different specia on 
kaleidoscopic feeling which perhaps proper editing might have ge "singer o 
Part II on "Clinical Problems” opens with Chapter 3 by D. M redis as a 
“Signs of Personality Disintegration.” His stress on viewing 5» "trative eem 
dynamic whole is a precaution well stated to the beginner. Thei peut i^ 
material, half of the chapter, is well presented. However, brat discussio? 
tant point of view which the author apparently overlooked in 
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This is the recognition of the clinician's own 


the clinician's responsibilities. 
s means of handling them—a point to be 


problems, and his awareness of hi 


Stressed early in training. 
Chapter 4, concerned with "Behavior Problems of Children" is written by 
z L. and H. H. Anderson. What is particularly one-sided is the emphasis upon 
ominative behavior of parents as à prime source of maladjustment, and one 
wonders why there is utter neglect of the dynamics of rejection, sibling rivalry, 
tion is treated from the standpoint of 


rer protection and submission. Domina 
he objective effects upon the child and his overt resistances to it, but no men- 


tion is made of the psychodynamics involved. (Neither Symonds nor Levy is 
mentioned in the bibliography.) It would seem that the editors, as clinicians 
themselves, should have been impressed with the one-sidedness of this chapter. 

Mandel Sherman, in Chapter 5 ("The Mental Defective"), confines himself 
to a discussion of syndromes encountered in mental deficiency—cretinism, mon- 


golism, etc. A paragraph of some twenty lines devoted to environmental de- 
does violence to this very important 


ficiencies, in the opinion of this reviewer, 
topic by nature of its very brevity- There is also no treatment of the personality 
structure of the defective, especially as delineated by Kounin. 
Chapter 6, “The Physically Handicapped,” by S. Marzolf, discusses the 
Sensory, motor, and other disabilities (chorea, encephalitis, etc.), from the 
While very good from these 


standpoint of etiology, description, and diagnosis. i : I 
scussion on the meaning of physical handicap 


Standpoints, there is ver little di 1 
to the patient, although relerenté is made to the recent work of Barker, Wright, 
and Gonick in the bibliography. Again, it is felt that stress should be not so 
much upon static symptoms and syndromes but upon dynamics of the situation. 
R. M. Bear's Chapter 7 on “The Educationally Backward is a generally 
adequate treatment of disabilities in the school subjects. A minor error occurs 
on page 143 where the Binet is listed as a test which does not place a premium 
9n verbal capacity, an error which could have hardly escaped the editorial eye. 
Chapter 8 on “Speech and Personality," by F. H. Sanford, is somewhat ofa 
condensation of his 1942 article in this journal. There is thus a difference in 
Style and approach between this chapter and the others, concerning which the 
editors are aware. There is much more of a research orientation (as witnessed 
Y 74 references, more than any other chapter, and by the content). A debata- 
le statement occurs at one point (P- 174), where it is stated that there cdd 
cen no major changes in the theory of aphasis and of allied diouer sinen , 
days of Head," with reference to the latter's 1926 publication. a e m o 
urt Goldstein and his associates in this area 1s neither wet nor e : 
A. C. Washburn's treatment of "Tension States (Chapter 9) has Late ar 
Yalue in orienting the student to the stresses which bring about these conditions. 
he discussion of even the physiological and neurological aspects Is geared wid 
level thoroughly comprehensible to the beginning student. However, one —€— 
SÉ to have seen more case material drawn e aue psychosomatic itera- 
ure, since th is referred to in the bibliography- d 
doe doer d dac is the title of Chapter 10 by J. Ball. Basically 
= is a good succinct presentation of the analytical — o ee 
evelopment as a background. One point is moot, t ough, namel 
Ball states th page iod has little significance for later sexual prob- 
l I at the latency peri? a i ifficulties in ado- 
bin However, it is the repression? of latent e ee relations 
Scence. Her conclusions on unhappiness as related P clinic cages 
may be too heavily weighted with results from studies o! € ini 1 


VIEW. 
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Chapter 11 by R. G. Kuhlen covers “Psychological Trends and mon f 
Later Maturity." It covers the field well, emphasizing the increase pe wig 
of the aged, change in abilities and adjustments, and practical Reime eod 
roblem. Unlike other chapters, however, there is no illustrative mate . 

utei on the senile personality, and thus there is editorial Inga 4 she 

Chapter 12 by H. Cleckley on “Antisocial Personalities is per s fee 
weakest chapter of the book. There are three principal Lap e w S 
linquent, the criminal, and the psychopath. The section on bhes : e title 
merely a series of brief descriptions of delinquent personalities, wit h OE 
on the causal factors in delinquency. Although the work of Healy ane ety 
and of the Gluecks, e.g., is noted in the bibliography, the author giv o ing. 
couht at all of the findings of these writers in this important field. ie om 
raphy does not cite any sociological work by people like Shaw and E ratte 

R. M. Dorcus's treatment of '"The Psychoses and Psychoneuroses ERE 
tutes Chapter 13, and is essentially an overview of any abnormal psy ed : J 
text. Case histories, especially for the beginner, are particularly pod D | 
editing. One wonders what the following medical terminology, sd bridge * 4 
the student without medical orientation (p. 280): “epithelioma athe i 
the nose and senile keratosis below the outer canthus of the le P eoassioi 
domen distended and tender but not fluid percussed. Emphasis S aliases 
is uneven—27 pages devoted to organic psychoses; 13 to functiona bs upon 
and only 8 to psychoneuroses. Although Dorcus is aware of the emp 313 that | 
the organic, it is certainly at variance with his own statement, on p. ? al psy-~ 
“psychoneurotic disorders represent the major area in which the clinic? 
chologist is most likely to function." 4G. E- 

Part III on “Clinical Methods” consists of four chapters. Chapters 1 i 16 
Horrocks, “The Measurement of Achievement and Aptitude") and CHER 
(W. T. Donahue, "Interest and Personality Tests") are descriptions of ur reli 
commonly used tests of achievement, aptitude, interest, and personality» e) to 
of the paper-and-pencil type with discussion of concomitant problems re " in 
testing and some illustrative cases in which tests are used. A. Magaret ven 5 
telligence Testing, Clinical Practice" (Chapter 15) has an excellent diets 
the uses and limitations of the individual tests of intelligence 1n the à vi fie 
situation. For the beginning student the presentations of the difficultie oe 
herent in current "scatter analysis," especially with the Wechsler-Bellev ue, oat 
of observation of client behavior in the test situation are particularly a ib 
mendable. The discussion of the Shipley-Hartford CQ Scale, however, fai tive 
indicate its demonstrated invalidity. H. Sargent’s Chapter 17 on “Projec m 
Methods" is effective in introducing the student in lucid style to the uem ae ee 
projection and to the tests themselves. The discussion on the theoretical Je th- 
ground of projection and the sections on cautions in use of the projective or 
ods, especially her emphasis on the variable of the clinician himself, are P* 
ticularly effective. E upi- 

Part IV, “Psychotherapy,” opens with Chapter 18 by F. McKinney 0n espe" 
rective Techniques." It is essentially a review of the directive processe? "y: 

ially as used in the college counseling situation. There may be some mislead! © 

cially l 1 y y aP 
connotations, though, in that techniques common to practically all therap 458, 
pear to be subsumed under the directive heading. For example, on. Pr prie 
under “Forms of Directive Therapy" appears “Expressive Therapy t Mu i 
discussion of catharsis; on p. 450, under "Directive Therapy The 0 E 
Atmosphere” is a short description of Rogers’ permissivesness' Group pu 
tmo P -- "Ie mentioned as a directive method, which it may or may 
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“Client- "i b 
satione Cantar Ties ea earns d D fing. The 
y ern [ i ginning s udent may find bewildering. The 
g ive approach is termed negativistic. Again, on p. 470, he states that 
afe o theoretical contribution to non-directive therapy which definitely 
sit ih d i asi is Rogers' belief that the use. of psychometric tests can 
Dep with the process of therapy" and only six pages later (p. 476), in 
piene i a case, Snyder states, “at this interview the results of the recent 
oo and personality tests were discussed with Mr. D. ..." This is fol- 
aan on p. 477 wi th a whole page of test results! Of particular value, however, 
S nyder's frank discussion of the limited value of the non-directive method 
with certain types of clients. 
f Maladjustment" by W. Johnson, while pre- 
f this problem, suffers by contrast with 
the two preceding chapter: ust how the semantic therapist 
operates. The specific techniques o ticist are couched in general 
terms, apart from mention of procedu on to all therapies. There is 
complete absence of illustrative case material, a point upon which perhaps the 
editors should have insiste 
comprehensible to the student. 
“Recent Advances in the Treatment of Mental Diseases" is the title of 
tment, prefrontal lobotomy 


Chapter 21 by G. K. Yacorzynski. Shock trea 
(with a good discussion, of considerable interest, attempting to synthesize both 


from the standpoint of physiological theory), narcosynthesis, hypnotism, psy- 
chosomatic medicine, and experimental neurosis are covered briefly. It is sur- 
prising to read (on p. 542) the following: “One disadvantage of a psychologist 
Serving as a psychotherapist is that he may lose sight of his primary objective, 

ding of mental illness. . . . 


that of contributing to our knowledge and understanding 1 
f therapy in no way indicates that the therapy 


The practical beneficial results o 

taps one of the major processes of human adjustment. Theories of human be- 

havior built on such evidence are to be accepted with great caution." However, 

the psychologist who acts as @ psychotherapist may make some penetrating 

observations on behavior and “hue contribute to understanding, i.e., research. 

Orientation towards the practical aims of therapy, for example, provided us with 
i hoanalysis. One may 


a very influential theory and research stimulant, viz., PSYC i 
also question whether any benign therapy takes place without tapping à funda- 


mental adjustive process. 
th Part V on *'Professiona 
at concerning "Psychiatry an 


Psychiatrist. It considers the ro i 
on is emt 


d in order to ma 


1 Relationships" constitutes but one chapter (22), 
d Clinical Psychology," by D. G. Wright, a 


le of the psychologist, cooperating with the 
Psychiatrist. Wright’s discussi nently fair in its recognition of. the 
art of the book, though, one would have liked 


ets duties. In this P: : ough, , € 
fo have seen a discussion of other professional relationships of the psychologist 

in relation to the social worker, the educator and the industrialist, etc- 
b ve a book in which style of writing is varia- 
le, which in many places may be repetitious of work previously 
which the treatment of the 


bs H 
Overed by the student in other courses, In r t 
from the very good to the mediocre, 1n which 


In summary, then, we ha 


eni 
en matter ranges 
Pertinent illustrative case ma erial may be present or opm ee above 
» In which tf ” iting appears to be somew at lacking. 
the over-all editing apP RaLPH D. NORMAN. 


Uni: n 
niversity of Michigan. 
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INTRODUCTION 
alization and its presumed 


ml eee The doctrine of CR gener 
rophysiological mechanism of irradiation and concentration entered 


m pa of behavior or, in his own words, "highest nervous 
lasiin Y, rather late. Extinction, the four kinds of internal inhibition 
(latar pid differential, delayed, and conditioned), external inhibition 
exist : — inhibition of inhibition, and even analyzers, differ- 
eun ists rais by contrasts, conditioning of higher order, and com- 
eve ege itioning—all preceded it as concepts or phenomena. How- 
missed en irradiation-concentration was promulgated (34, 1910), it 
Sasi: Ae paramount position. Forming the core of all cortical dy- 
, it became the "basic law of the highest? nervous activity" (34, 
concepts 2 


"me subsuming all older nd adding new ones, as sub- 
ead to account for alleged new CR discoveries. As is well known, 
non Ov sentire system is conceptualized primarily upon terms borrowed, 
i geni from classical neurophysiology: But one may also discern 

is constructs the influence of theories of hydraulics and of sound 


tr; "e ^ 
ansmission, the psychophysiology of G. H. Lewes, the logic and 
publication while the writer wasa Fellow of the John 
ndation. 

eferences 
himself, however, 


way from those pu 


1 " 
ls ct was prepared for 
* For ggenheim Memorial Fou 
of bue mere the writer's T 
is d Bekhterev. The writer 
the Russis: ions may thus differ in some 1 
and qui n word VYSSHEY—@ word that occurs also in the subti 
ite commonly in the text—is translated by Gantt as higher. 


highest ; 
est in abstracting the book later translated by Gantt, and then fou 


tra 
nslators used hachste. 
337 


are to the English translations of the works 
uses the Russian originals, and 


plished. In this particular case, 
book 
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mental chemistry of J. S. Mill, and the stream of consciousness of 
William James.? 

Pavlov's empirical findings of CR generalization in dogs were 
supposedly confirmed by Bekhterev, the neurologist and psychiatrist, 
in human adults (4), and by Krasnogorski, the pediatrician, in children 
(26). And both Bekhterev and Krasnogorski fully accepted Pavlov S 
neurophysiological interpretations (op. cit.). However, Beritofl, the 
Georgian (U.S.S.R.) physiologist, and Konorski, the Polish biologist, 
were highly critical of Pavlov's cortical constructs (5, 25), and it 1s 
known to the writer that a number of Russian psychologists were 
dubious even about some of his findings. Still, Pavlov's prestige was 
very high, and opposition to his views—except by Beritoff—was very 
much subvocal in nature. Criticisms came later, in the Thirties, but 
then they were not on experimental grounds. The teachings of Pavlov 
and Bekhterev were declared to be not accordant with Marxism: 
materialistic, yet asocio-political and insufficiently dialectical. Th 
criticisms were directed mostly against the Human Reflexology ° 
Bekhterev—even though Bekhterev himself wrote a long monograph (3) 
attempting to prove that Reflexology is Marxian in essence. Pavlov $ 
laboratory, experimenting with animals, was given greater subsidies: 
Marxists, like Cartesians, draw sharp dichotomies between human 
beings and animals. . 

In this country, in the early days of Behaviorism, CR generalization 
was taken as a qualitative fact, and even demonstrated by Watson ^ 
who called it transfer—in his emotional conditioning of Albert (59)- 
But when Pavlov's books came out in English (33, 1927; 34, 1928), 5 
number of American students of behavior became critical of Haee 
theories and skeptical of a good many of his facts. Lashley, himsel | 
pioneer in CR experimentation, led in general criticism (28); Louc p 
& young and able CR experimenter, took to task the doctrine of í 
radiation and indirectly that of generalization (30, 31); and Guthrie: = 
old-time Behaviorist, offered a new theory of generalization and of e 
ditioning as such (11, 12, 13). On the other hand, Hull, quite affec et 
by the quantitative and systematic impliations of Pavlov's books, ê 


up a special experiment (with M. J. Bass) (2) to test Loucks' scathing 


iter 

? From a few long conversations with Pavlov in the summer of 1934, the ue 
gained the impression that Pavlov was quite conversant with the writings of V fr 
James, and the British Associationists (Pavlov studied at, but did not gradua ike 
the Ryazan Theological Seminary). However, despite his gallant tribute to + active 
(spelled Thorndyke in Anrep's translation— 33, p. 6) as his predecessor in the Kcahler in 
analysis of behavior and his discussion of the views of Guthrie, Lashley, and Kö wi 
one of his articles (35), it is the writer's opinion that Pavlov had little familiaritY 
modern American comparative and systematic psychology. 
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bar ir gt i and generalization. , Hull was apparently 
A sults redeemed Pavlov's findings and accorded CR 
ari systematic status as postulate 5 in his system (20). Gen- 
i Sie e in general, became fashionable at Yale, in the mid-Thirties. 
[veas started a series of experiments in the field (16, 17, 18); and 
m y icem upon a scheme of “deducing” discriminative learning 
i R generalization (55, 56, 57). Hilgard and Marquis allotted an 
Spp oving half-chapter to generalization in their book (15), and other 
extbooks followed in accepting it as an established fact, presumably an 
objective and quantitative substitute for the old “Law of Similarity” 
advocated by J. S. Mill and by Spencer but disputed by most other 
Associationists (8, 54, 58). 

: The Lashley-IIull controversy. The entire problem of CR generaliza- 
tion sprang up, however, again two years ago with a very stimulating 
article by Lashley and Wade (29). Lashley and Wade challenged every- 


thing in the doctrine: its facts, its interpretations, its significance. They 
for its specific criticisms, but 


Cited Loucks' analysis, apparently not only 
usions drawn by Pavlov and 


also for its general implications that concl 
his students ought not to be taken at their face value. They stated that 


Bass and Hull (2) and Hovland (16, 17, 18) did not really corroborate 
primary stimulus generalization inasmuch as they used human subjects 


"for whom the stimulus series represented familiar relational sequences" 
thinking" (p. 75). They 


and who may have used "habits of relational 
“failed to demonstrate a 


mentioned the study by Wickens (60) who 


Stadient of stimulus generalization’ and one by the writer (48) who 
ent types of stimuli too variable to 


found "generalization with differ 

formulate under any simple laws." Finally, Lashley and Wade detailed 
à series of their own experiments in all of which they failed to find 
Seneralization in animals or human subjects by their own technique of 
n to a single stimulus then 


training a group of subjects in a reaction 
pen her on the same stimulus dimension and 
habit when the 


9Pposing that stimulus to anot same stim 
Comparing the rates of formation of a discriminative hi when 
Teaction to the initial stimulus is reinforced and when it 1s extinguished 
Y the differential training.” 
Lashley and Wade's interpretations are eve -reac 
their criticisms of the experimental findings of CR generalization. 
terpretations may be best summarized in their own words. They are: 


p. jg te is no "irradiation" or spread | 

2. The pl NT lization” represent a failure of asso- 

iati “stimulus generalizatio? at S 

“lation (p, Uri oim hen the distinguishing characteristics of mk 

iis arte associate them with the conditioned reaction (p. E . +» [and this 
"eralization is thus really a] generalization by "default" (p. 82). 


n more far-reaching than 
These 


of effects during primary conditioning 
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3. The “gradient of habit strength” is a product of variable stimulus thresh- 
olds, not a spread of associative process (p. 74). .. . The gradient varies with 
degree of attention and is unrelated to habit strength. ... When inattention 
or threshold values are not involved, no evidence of a gradient is found. ..* 
Consequently, a test for “irradiation” may give the appearance of a gradient of 
habit strength when it is actually measuring discriminative thresholds under 
distraction (p. 84). 

4, It [true generalization] does not occur in conditioning to a single stimulus 
but is somehow a function of differential training with two or more stimuli on 
the same dimension (p. 82).... The "dimension" of a stimulus series . - : do 
not exist for the organism until established by differential training (D. 74). 
. +. The dimension itself is created by or is a function of the organism and only 
secondarily, if at all, a property of the physically definable character of the 
stimuli (p. 82). ... In the early stages of Pavlovian conditioning the only 

dimension” common to such [tested for conditioning] stimuli is that all pro- 
duce a sudden change in the environment. ... With continued training the 
subject ... may or may not show narrowing of the effective range on a stimulus 
dimension. Apparently such changes are a matter of chance noting of differ- 


(p. 81) generally with little regularity [reference to a study by the writer (48)] 
p. is * 


Hull replied to Lashley and Wade (21). His reply is much more a 
defense of the experimental findings of CR generalization than 4 dis- 
cussion of Lashley and Wade's special interpretations. He cites & 
private communication by Wickens that his [Wickens] data were in 
some respects quite harmonious with Hovland's findings and only when 
the Chi Square was calculated did “none of these differences [the 
rac raed between the conditioned and generalization stimuli] reach 
ee a E cent level” (p. 130). He further states that in some 9, 

ovlan s experiments "the gradient at the first trial was horizontal 
E that “If the gradient eventually found were due to the indirec 

aia previously formed habits, e.g., of speech, it should have aP; 

RUIN pride d E trial” (p. 128). Hull then divulges unpublishe 

Lashley nid Wed. s porn a nnn generalization gradient "m 
meom d ade technique, and reproduces seven graphs of CR 8* 
zations from the studies of Anrep (1), Bass and Hull (2), HovlaP 

(16, 17), Brown (7), and Wickens (60): Hull's own views need not be 


^ Hull states: n gon? 


; “The reference given b been £^ 
over with great care, but the eas ee = ppm pis ipe pt anything 
which would cast doubt on the reality of the falling gradient of stimulus generalizat a 
tm The article is essentially a contribution to a controversy concerned with response v 
explicit patterns of stimulations, a matter which is not under consideration here" ah n 
130). Hull is obviously discussing another study by the writer (43) which is gi, 
Lashley and Wade's bibliography, and not the particular one (48) to which they ed 
in the text. There isa typographical error in Lashley and Wade's reference: the write 


" 
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Dac a a pet pec from is writings on this topic 
de Wade : x ilgard (14, ) seems now to lean more to Lashley 
ad a ews than to those of Pavlov and Hull in stating that "gen- 
E s may be along discriminated dimension" (p. 338). But a 
ae Pas i line generalization curve with a modified Lashley and 
2 EH ec inique was found by Grandine and Harlow (9) who, however, 
ate that their findings “give no definitive answer as to whether this 
phenomenon of stimulus CR generalization is the result of some pre- 
established gradient around the training point, or is the specific result- 
ant of the habit or habits established” (p. 336). 
Background of the writer's interest. The writer has been interested in 
CR generalization since he published (with C. J. Warden) his first 
Survey of pertinent Russian literature in 1929 (52). Through that 
Survey and a number of others, the writer became not only critical 
of the theories of Pavlov and Bekhterev (39-42) but also convinced 
that the very empirical findings of the Russian laboratories need con- 
Siderable checking and analysis. He came to learn that while the 


laboratories of Pavlov and Bekhterev are well advanced with respect 
ministration, controls of 


to apparatus, physical set up of stimuli adr 
Secondary cues, and refined measurements, their workers are very naive 
With regard to experimental design, sampling errors, and statistical 
treatments in general. Furthermore, it became known to him that in- 


dividual CR experimenters in Pavlov's laboratory seldom exercise in- 
Consequently, 


dependence in the interpretation of their own data. 
ally some significant 


When the writer wished 10 years ago to review critic 
aspect of conditioning, he chose extinction (45) rather than generaliza- 


tion, believing a good deal of the alleged phenomena of the latter to be 
little established. In 1937-1940 he carried out a series of experiments on 
the generalization of salivary CR's in adult human subjects. Three of 
the experiments have been published (44, 47, 48), but the publication 
9f the rest has been delayed (three are in press now). In all three pub- 
lished studies, some CR generalization was manifested, but its course 


and very existence varied so much with the type of stimulus used that 
1 findings cast grave doubt 


the writer stated in 1940 that his “empirica ES ee : 
Upon attempts to ‘deduce’ transposition in discrimination experiments 
from generalization” and that laws of transfer ‘‘must be patiently dis- 
Covered through experimentation and study of results at each level or 
orm of stimulus, response, and organism organization (48, p. 11). 


” rather than "(18)." This study 


Mentioned study should have been referred to as “(19) 
n Hull's article. 


does : 
not appear among the references i 
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id : i 
Purpose of the present review. While the alleged —€— d 
irradiation—the temporal after-effects of the application of co eie 
stimuli—is, in the writer's opinion, worth checking and cannot we 
sidered settled by Loucks’ analysis,’ and while response genera a 
offers interesting possibilities, —there is no doubt that the impor ^: 
problem for students of behavior is that of stimulus generalization O 
the more or less permanent CR influence of a conditioned stimulus gt 
a non-conditioned stimulus that is in some way related to it. , ily 
this is the problem which the present article purports to treat as tu 


i > p ¿amine all 
as feasible. Specifically, the article will attempt, first, to n 
available evidence in answer to four empirical questions (three prim 
and one secondary) on C 


Mir" eu me 
R generalization, and, then, to provide so 
integrative interpretations of the evidence and the answers. 
The three primary empirical questions are: 


H . . HH some 
1. Is there consistent evidence to show that an organism conditioned to 
stimulus will also pro 


duce a CR in some degree to some other related stimulus, 
with which the organism has had no previous pertinent experience? —— of the 
2. If the answer to the first question is positive, What is the relation e to 
strength of the CR to the related stimulus—magnitude, latency, resistanc 


The secondary empirical question is: 
What is the relation between the ar 
and the answers to the firs 
Pirical question, the gen 
given consideration here, 


Ts B ived 
nount of training that a CR has er em- 
t three primary questions? (Another secondary 


^ 
eralization of extinction cannot, unfortunately 
for lack of space.) 


" . i iri- 
The evidence to be examined. The evidence for answering the emp 


2 enel ? [ the 
quent integrative interpretations "lov! 
imarily in (a) the studies from Pa 


CR 
ditioning of dogs constitutes probably more than 90 per cent of the the 
work in the U.S.S.R., while there is reason for excluding from n 
present comparative 


with 4 
a number of Russian studies, pe nese 
closer re-examination—and some Te-analysis—of Loucks’ analysis, indicate 7 write" 
after-effects certainly occur more often that chance would warrant. However, t 3 
agrees fully with Loucks that they are not of a nature to prove irradiation. 
ably are results of some very initial differential conditioning. 


5 À preliminary analysis by the writer of 
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M us puri s Of the fonr, the study by Wickens (60) is com- 
y involving response as well as stimulus generalization and b 

the general vicissitudes of conditioning finger withdrawal. The epee 
ment by Brown (7), besides being instrumental rather than classical 
conditioning, used intensities as generalization stimuli without being 
able to control the factor of intensity per se (see later sections of the 
present article). The two studies by the Lashley and Wade technique 
(9, 29) will be utilized but might be better kept apart with respect to 
direct comparisons. On the other hand, the Pavlov, the Yale, and the 
Writer's studies have all used autonomic responses and classical tech- 
niques and have, in addition, the advantage that in the Yale studies the 
stimuli and in the writer’s studies the response were the same as in 


Pavlov's studies, the major source of evidence. 
Tue EVIDENCE FROM PavLov's LABORATORY 


Source material. Prior to 1924, most—but not all—full reports of CR 
Work in Pavlov's laboratory were published in the form of doctoral 
theses, in fulfillment of the requirements for the M.D. degree at the St. 


Petersburg Military Medical Academy where Pavlov served first as 
fessor of Physiology. These 


Professor of Pharmacology and then as Pro 
d as a rule represent, in the 


theses are usually 100 to 200 pages long an i 
Writer's estimate, about as much work as Ph.D. theses in physiology or 


psychology in a first-rate American university. Their tables, or rather 
Protocols, are very detailed, trial-by-trial presentations of magnitudes 
and latencies of salivation, and of amounts and qualities of food seeking 
motor accompaniments, for each stimulus and each dog. The writer 
knows of 49 such CR theses from Pavlov's laboratory and he has read 
most of them (two of the six reports in Loucks' analysis are such theses). 

Since 1924, however, the bulk of CR experimentation from Pavlov's 
laboratories has been published in the Trudy Fiziologicheskikh Labo- 


"ülorii Akademika I.P. Pavlova (Transactions of the physiological 
labor. atories of Academician I.P. Pavlov). The writer has in his posses- 


Sion ten volumes of these Trudy, from 1924 to 1941. These ten volumes 
(nearly all salivation in dogs). 


c i . 

entain 245 separate CR studies l : 

Studies are as a rule less extensive than those in the earlier doctoral 
d in great detail (the total num- 


t 
€ses and, moreover, are not presente 
tle over 3000). They were per- 


€r of pages in the ten volumes is a lit 
M.D. degree, but by Pavlov's numer- 
ysiologists of note in 


iet not by candidates for the 
Tesearch assistants and associates, often ph 
eir Own right. 


igi z aterial of CR 
riginally, the writer intended to analyze first the material o 
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generalization in the doctoral theses. But, unfortunately, these theses 
are not available to him this year, and he decided to confine himself to 
the data in the ten volumes of the Trudy. However, he does have im- 
pressions of the theses which he read prior to the preparation of this 
article. 

Method of analysis. Only very few of the 245 CR studies in the Trudy 
deal specifically with CR generalization—or irradiation —which Pav- 
lovians consider to have become established facts years ago- But a 
good number of the studies contain exact CR generalization data, m 
cidental to the solution of some other problem in "'cortical dynamics. 
Very often a CR experimenter, after he has formed a CR to some 
stimulus, will try out the CR with some other non-conditioned stimulus 
or stimuli, and these other stimuli are quite commonly related in some 
ascertainable way to each other and to the conditioned stimulus. The 
writer has thus gone over a few thousand tables in the Trudy, and every 
time that he noted that a CR was tried with a non-conditioned stimulus: 
he computed the magnitude of its salivation as a per cent of the saliva" 
tion to the conditioned stimulus, for the particular session and particular 
mal. In computing the percentages, account was of course taken © 
the equality of other experimental factors in the session; and to 2V?' 
the problem of differential conditioning, only the Rieck iw trials of 2 
CR toa non-conditioned stimulus were used. Moreover inasmuch as 
most experimenters from Pavlov's laboratory present - each exper. 
ment the CR history of each dog, generalization stimuli that appear? 
in such histories were excluded from the computatione 

As the data began accumulating, the writer decided to limit his task 
to four kinds of dimensional (other than intensity) CR generalization y 
three kinds of intensity generalizations, and a number of what, for lac* 
i another name, may be called inter-dimensional and inter-sen507? , ,) 
generalizations. The four kinds of dimensional (other than intensit? 
A a included (a) frequencies of beats of metronome? (4) 
iion re ibl o (c) frequencies of rhythmic tactions, au liza” 

$ of tactions. The three kinds of intensity SEP" jn- 


tions contained intensities ; ad if 
ne s of flashes of 1 i ities of bells, 2! 
tensities of whistles. ights, intensities o cR 


iza The inter-dimensional and inter-senso"* etf 
generalizations refer to generalizations between such stimuli as m 


rh the 
nomes and whistles, metronomes and lights or tactile stimuli, an 
like. ý 


B : sions? 
More specifically, data were obtained for the following conditio 


audes 

f .pistle 

. 1. All the 12 permutations of generalizations between metronomes, wh 

light flashes. and tactile stimuli; he 
x c ; to 

2. Four dimensional (other than intensity) positions, with regard 
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quencies of metronomes, tones, and rhythmic tactions, and to spatial distances 


of tactions; 
3. Three lower and three higher intensity positions, with respect to lights, 
, 


whistles, and bells. 


The dimensional and intensity positions were determined, with two 
imuli characteristics: fre- 


exceptions, from the objectively given sti 
quencies, distances, c.p.s.'s, and decibels. The two exceptions were some 


cases in which spatial distances had to be estimated by the writer, and 
some cases in which the intensities of bells and whistles were not given 
in decibels (only the experimenters' mere descriptions of “weaker,” 
4 


o 


“still weaker,” “weakest,” “stronger,” "still stronger,” and “strongest” 


being included). On the other hand, it should be pointed out that the 
between positions within dimensions and 


l, and that averaging had to be done by 
(i.e., I with I, II with II, III with HI, 


magnitude of the distances 
intensities were as a rule unequa 
combining corresponding positions 


and IV with IV). 

The writer is fully aware of some of the pitfalls of his analysis. He 
would like to point out, however, first, that the Pavlov laboratories are 
extremely routinized with regard to experimenter, design, apparatus, 
stimuli, and responses; and, second, that extreme care was taken by him 
to see that each of the nearly 700 comparisons between the CR's to the 
generalization and to the conditioned stimuli was performed under 
wholly equal conditions. Moreover, while a combined analysis may 
have masked the effects of some variables, one of these variables, the 
amount of training that the CR has received, has been singled out for 
special study. The data from frequencies of metronomes, spatial dis- 

Li 


tances of tactions, intensities of lights, and transfers from metronomes 
actionated so as to compare CR generalization 


01-300, and more than 300 reinforcements. 
bles—which the Russians maintain 
e of the animals, cortical ex- 


to lights have been fr: 
/ after 1-20, 21-40, 41-100, 1 ) 
Similar treatments of other varia 


affect CR generalization, such as the ag j 
tirpation, certain drugs, and a few more—would have been possible 


but were not considered worth while undertaking, at least for the time 
being. It is believed, though, that the alleged effects of these variables 
might well have been cancelled out, since the writer included in his com- 
bined analysis data from both supposed generalization-increasing and 


Supposed generalization-decreasing studies. 


Dimensional (other than intensity) 


CR Generalization 
ensity) CR 


i er int 
onal (other than in [ 
xm Each entry 1n the table is a 


Results for 


All the Trudy data on dim 
generalization are combined in Table 
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mean per cent of generalization of a number of determinations (exact 
number is given in parentheses), obtained from 8 to 67 different experi- 
ments. The /'s and P's of the entries in Table I—as well as of the entries 
in the subsequent tables—have all been ascertained, but their separate 
tabulation would unduly lengthen the article. The most pertinent ones 
will be taken up in the text, in connection with each finding. 


TABLE I 
DIMENSIONAL (OTHER THAN INTENSITY) STIMULUS GENERALIZATION OF SALIVARY 
CONDITIONING IN Docs. DATA FROM 67 DIFFERENT EXPERIMENTS 
IN PAvLOov's LABORATORY 
Each entry is a mean per cent of conditioned salivation to the non-conditioned 
generalization stimuli. Figures in parentheses are numbers of determinations. 


Conditioncd Steps Removed from Conditioned Stimuli 


Stimuli I T "T IV 
Metronome 61(28) 57(22) 43(16) 45(11) 
Tone 69(23) 59(19) 46(13) 42(10) 
Spatial Taction 78(18) 59(14) 63(9) 54(8) 
Rhythmic Taction 51(16) 57(13) 41(8) 44(8) 


As seen from Table I, the answers to the first two of the previously 
posed "primary empirical questions" (supra, p. 342) are unmistakable. 
Non-conditioned generalization stimuli do evoke conditioned responses: 
and these responses are smaller in magnitude than those evoked by the 


tioned and the generalization CR's are significant with P equalling Ot, 
the smallest ¢ being 17.4. However, the answer to the third question, the 
question of a CR gradient, is by no means certain. Four of the 
differences between adjacent generalization stimuli are reversals, anc 
four of the remaining eight differences are insignificant, with P equalling 
.05. Yet there would be no reversals and differences between adjacet? 
generalization stimuli would be significant if the four dimensional steps 
were reduced to two by combining the first step with the second and the thar 
with the fourth. The importance of the last statement, as well as the 
fact that two of the four reversals occurred between the third and the 
fourth dimensional steps, will be discussed in a later section. 


Results for Intensity CR Generalization 


All the data on CR generalization of intensities are contained 1n 
Table II. They are given in percentages, and separately for each 0 h 
three lower and each of the three higher intensities. Unlike Table *: 
gradients in Table II are quite consistent. There are no reversals, 2? 
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hos dm Mies between adjacent stimuli are significant with P equal- 
D» = . However, Table II shows two gradients: a descending one 
9 ver intensities but an ascending one for higher intensities. Indeed 
is very doubtful whether intensity generalizations could be at all cont 


TABLE II 
ZATION OF SALIV. 
EXPERIMENTS IN PavLov's LABOR 
d salivation to the non-conditioned 
numbers of determinations. 


STIMULUS INTENSITY GENERALI 'ARY CONDITIONING IN Docs. 
ATORY 


DATA FROM 54 DIFFERENT 


Each entry is a mean per cent of conditione 
generalization stimuli. Figures in parentheses are 


Higher Inlensity Steps 


Conditioned 
Stimuli 
I II Ill I II Ul 
Lights 7914)  69(11) 58(8) 118(14) — 128(9) 149(8) 
Whistles 68(12) 58(11)  49(9) 13713)  149(8  165(8) 
Bells 72(11) 64(9) 56(7) 124(10) — 138(8) 149(6) 


sidered true CR general and, 17). At any rate, these 
generalizations ; generis, and little ought to be inferred 
from them to generalizati intensity dimensions. Further- 
more, there is reason to ke factor, namely, 
psychological intensity resulting from summation of stimuli, masks 
true generalizations along such stimuli dimensions 2$ frequencies of 


metronomes and of tactile vibrators. 
Dimensional and Inter-Sensory CR Generalizations 
s disclosed 


about this type of CR generalization, a: 
ount. for instance, more CR 


ne metronome 
s much generaliza- 
ther four steps 
in the writer's 
de of CR generalization 
s of the generalization 
imulus dimension Or 
out for a variation 
na phenomenal 


Results for Inter 


generalization from a 
to another three steps rem 
tion from a light to a whis 
away. These results, among ot 
Opinion, upon any postulate that 
Pw varies with th 
di uh to the conditio 

imensions. A better case could perh 


wit 
With some denotative relatedness OF 
world. 

much easier t 


itis very 
d, however, that it 1$ sier t 
i onal and inter-sensory stimuli 


m one 


è It should I " o establish differential 

conditionin. am mentione d : than between stimuli 

on the s & between inter-dimens! 
ame dimension. 
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Results for CR Generalisation as a Function of the Amount 
of Training of the CR 
With the exception of intensity generalization which was already 
noted as a special case, all the data in Table IV point to the following 
conclusion: (a) that CR generalization increases in the very initial 
stages of training the CR; (b) with further training of the CR, it begins 


TABLE III 


INTER-DIMENSIONAL AND INTER-SENSORY STIMULUS GENERALIZATION OF SALIVARY 
CONDITIONING 1N Docs. Data FROM 58 DIFFERENT EXPERIMENTS 
IN PAvLOv's LABORATORY 


Each entry is a mean per cent of conditioned salivation to the non-conditioned 
generalization stimuli. Figures in parentheses are numbers of determinations. 


Conditioned 


Generalization Per Cent of 


Stimulus Stimulus Generalization 
Metronome Whistle 44(33) 
Metronome Light 38(31) 
Metronome Rhythmic taction 40(27) 
Whistle Metronome 42(21) 
Whistle Light 36(19) 
Whistle Rhythmic taction 38(20) 
Light Metronome 38(22) 
Light Whistle 39(24) 
Light Rhythmic taction 34(18) 
Rhythmic taction Metronome 41(19) 
Rhythmic taction Whistle 36(21) 
Rhythmic taction Light 32(17) 


to decrease slowly; and (c) after a great number of reinforcements, thé 
generalization may increase again. All the ¢’s between 1-20 and 217 M 
reinforcements are significant (with P equalling .01), while the /'$ T 
tween 21-40 and 41-100 and those between 101-300 and more than - 


reinforcements are significant (with P equalling .05), for frequencies p 
metronomes and for spatial distances. 


Summary of the Evidence from Pavlov's Laboratory din 
B . B i i 
A statistical analysis of the data on CR generalization ut 
67 experiments, performed between 1924 and 1941, in Pavlov's la urt | 
tory provides full and clear answers to the first, second, and a el | 
questions posed in the first section of this article but offers only a P? 


[SS 
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TABLE IV 


STIMULUS GENERALIZATION OF SALIVARY CONDITIONING IN Docs As DEPENDENT 
UPON THE NUMBER OF REINFORCEMENTS OF THE CONDITIONED STIMULUS. 
DATA FROM 67 EXPERIMENTS IN PavLov's LABORATORY 
Each entry is a mean per cent of conditioned salivation to the non-condition 
generalization stimuli. Figures in parentheses are numbers of determinations. 


Number of Reinforcements of the Conditioned Stimulus 


Types of CR 
G izali 
eneralizations 1-20 21-40 41-100 101-300 300 up 


Frequencies of Metronomes — 39(12) — 647) 5506) 4902)  58(15) 
72(10) 64(9) 61(7) 74(11) 


Spatial Distances of Taction — 54(8) 
Metronomes to Lights 31(6) 42(8) 41(5) 36(5)  39(7) 
96(10) 9901)  103(14) 10002) 96(17) 


Intensities of Lights* 


* Combining lower and higher intensities. 


and not wholly certain answer to the third question. Specifically, th 


answers are: 

1. Dogs conditioned to secrete saliva upon the application of some condi 
tioned stimulus will also produce conditioned salivation upon the applicatio 
of related stimuli with which the animals have had no previous pertinent exper 
lence. i E . me 
2. Unless the intensity of the related stimulus 1s considerably higher tha 

gnitude of the related generalizatioi 


that of the conditioned stimulus, the ma dei 
CR will be considerably smaller than that of the trained Š " * 
3. CR generalization increases in the very initial stages of training the CT 


but upon further training begins to decrease slowly, while after a large numbe 
of reinforcements (over-training) it may increase again. 

4. The gradient of CR generalization 1S very crude, co 
or three steps. : 

5. The gradient does not vary merely with | 
generalization stimuli to the conditioned stimuli along sor 
or dimensions. 


nsisting only of twe 


h the degree of relatedness of the 
me stimulus dimension 


THe YALE STUDIES 
yzed here have been performed with 
GSR as the conditioned response. Unlike 

Yale investigations have 


the 

not only the physical setups but 
of their problems. Indeed, 
Jd up as CR Ph.D. para- 
ual experimenters at 


tations than indi- 


All the Yale studies to be anal 


human subjects and with the 
the studies from Pavlov's laboratory; 
mastered, as might have been expected, 

also the design and the statistical treatments 
Hovland's three experiments may well be held D 
digms. Moreover, there is little doubt that indivi 
Yale exercise much more independence 1m IBEEKpre 


vidual experimenters in Pavlov’s laboratory- 
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Hovland’s Experiment with Frequencies of Tone (16) 


This experiment has been quoted so often as an exemplar of a lawful 
and consistent CR gradient that the writer takes the liberty to re- 
produce here, for purposes of analysis, its main, and practically only, 
table (there are two more single line tables in the article). Hovland's 
table becomes Table V in this article, and the reader is asked to examine 
it very carefully. 

TABLE V* 


GENERALIZATION OF EXCITATORY TENDENCIES 


Amplitudes of galvanic responses (in mm.) to conditioned tone (0) and to tones 25(1), 
50(2), and 75(3) j.n.d.'s removed in frequency. Each value is average of two determina- 
tions following 16 reinforcements. Subjects I-X were conditioned to tone of 153 cycles; 
subjects XI-XX to tone of 1967 cycles. 


Tonol Stimuli 


Subject —— 
0 1 2 3 

I 16.2 11:3 12.6 11.4 

H 22.4 18.1 25.4 20.9 

IH 13.5 6.7 11.2 6.3 

IV 15.5 6.4 3.8 1.6 

V 19.2 18.5 22.8 15.3 

VI 16.2 18.5 11.9 13.9 
Vil 28.7 f. 18.0 16.5 
Vill 11.5 14.1 9.6 13.8 
IX 13.8 10.3 13.4 9.9 

x 22.4 18.7 10.7 15.3 

XI 23.7 21.3 21.4 10.2 
XII 16.5 20.6 13.9 12.4 
XIII 17.7 18.4 15.2 13.7 
XIV 18.6 13.9 14.3 12.5 
XV 15.9 15.5 13.8 14.2 
XVI 18.8 12.3 14.6 9.7 
XVII 21.3 9.7 10.5 12.8 
XVIII 23.2 17.8 13.9 14.5 
XIX 13.8 16.8 9.3 11.8 
xx 19.9 12.3 6.9 15.6 
Mean 18.3 14.91 13.62 12.89 
. P.E.M. 0.57 0.64 0.8 0.49 


* Reproduced from Hovland with permission. 


As seen from Table V, the mean magnitude of the CR to the condi- 
tioned stimulus was 18.3 +0.57 (P.E.) mm., while the mean magnitudes 
of the generalization CR's—removed 25, 50, and 75 j.n.d.’s from ae 
conditioned stimulus—were respectively: 14.91 +0.64, 13.6 £0.80, anc 


*—— 


india 
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i +0.49 mm. The differences between the magnitude of the condi- 
et pes CR and the generalization CR's are fully reliable statistically, 
and so is the difference between the CR to the first and to the third 


generalization stimulus (Diff./P.E.ai«— 4.21). But the differences be- 


€ adjacent generalization stimuli, i.e., between the first and the second 
nd between the second and the third are not reliable statistically and are 


rather small. Furthermore, if we examine the records of individual sub- 

Jects, we find the following: 

ë 1. Nineteen of the 20 subjects sh 
R's to more remote stimuli greater t 


2. Seven of the 19 subjects had mean 


than the mean CR's to the conditioned stimulus; 
3. The 20th subject, who showed no reversals, hardly manifested any grad - 


ent between the second and the third generalization stimuli—respective CR's 
being 21.3 and 21.1 mm. 

Thus, not a single of Howand's 20 subjects revealed a consistent 
gradient of CR generalisation and only one of the three gradient-determining 
differences was statistically reliable. 


owed one or more reversals, i.e., had mean 


han to less remote stimuli; 
CR's to generalization stimuli greater 


Hovland's Study with Varying Intensities of Tones (17) 

Two equated groups of 16 subjects each were used. The conditioned 
stimuli were a tone of 86 decibels for one group and a tone of 40 decibels 
for the other group, while the generalization stimuli were three descend- 
ing tones 50, 100, and 150 j.n.d.’s removed from the conditioned stimu- 
lus for the first group, and three similarly removed ascending tones for 
the second group. Each conditioned stimulus and each generalization 
stimulus was tested three times, after 16 reinforcements: of the condi- 
tioned stimulus with the unconditioned stimulus of electric shock. The 
mean CR to the conditioned tone of 86 decibels was 17.85 mm. while the 
respective CR’s to the descending generalization tones were: were 
10.94, and 8.9 mm. But the mean CR to the conditioned tone o 40 
decibels was 10.3 mm. while the respective generalization CR's were: 
12.98, 14.3, and 15.75 mm. Combining the results of the two ges 
Hovland obtained a mean CR of 14.3+0.6 to the conditione |n 
and means of 13.7 X0 d 12.62+0.44 to tr three 
non-conditioned gener d states that ga com- 
bining “enables one to d it cia ned 
with the intensity effect pe” se held constant (p. 282) and be is "à 
it is justifiable because the relationship between intensity an Les 
tude of response conditione " (p. 285). This linearity ed cs 
rq vats establish ation with i ld constant 
gradient of i it neralization v d : 
is ee a oe s data indicate, very small and very unreliable 


Statistically. 
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Hovland’s Study of CR Generalization as a Function of 
Amount of Reinforcement (18) 


Hovland's findings here are so much like those from Pavlov's 
laboratory—namely, an increase in generalization in early stages of 
training the CR, then a slow decrease and then again an increase—that 
they need not be restated. The only difference between the types of 
studies is that the decrease in generalization began in Hovland’s sub- 
jects after 16 reinforcements and in Pavlov's dogs after about 4Q rein- 
forcements, à difference that is only apparent, since more reinforce- 
ments are as a rule required to form a salivary CR in a dog than a con- 
ditioned GSR in a man. 


Humphreys! Experiment (23) 


Humphreys used two methods of reinforcement, a regular and a 
special, in his experiment with conditioning frequencies of tones. With 
the regular method, used for 34 subjects, the conditioned stimulus was 
always reinforced; while with the special method, used for 20 subjects, 
only half of the applications of the conditioned stimulus were accom- 
panied by reinforcement (electric shock). One of Humphreys' three 
generalization stimuli was 5 jn.d.'s removed from the conditioned 
stimulus (a tone of 1967 cycles), and another was 15 j.n.d.'s removed. 
The third generalization stimulus, on the other hand, was removed 25 
j.n.d.'s from the conditioned stimulus in one half of the subjects. But 
in the other half it was a tone 26 j.n.d.’s removed, but the lower octave 
of the conditioned stimulus. Humphreys’ results show the mean 
magnitude of the CR to the conditioned tone to have been 3.52 mm., 
with magnitudes of 2.91 for the tone 5 j.n.d.'s removed, 2.84 for the 
15 j.n.d. tone, 2.45 for the 25 j.n.d., and 3.27 mm. for the lower octave 
— when the regular method of reinforcement was used. With the 
special method of 50 per cent reinforcement, the mean magnitude of the 
CR to the conditioned tone was 3.04 mm., while the mean magnitudes 
of the generalization CR's were respectively: 3.76, 4.19, and 3.13 mm. 
(The last figure is a combination of the CR's to the tone 25 j.n.d.'s 
removed and to the octave 26 j.n.d.'s away, no separate figures being 
given by the experimenter.) Thus, we have here an experiment on CR 
generalization that shows a reversal and very small and unreliable differ- 
ences between CR's to generalization stimuli differing widely in dimen- 
sional relatedness. Hull (21) does not cite Humphreys' study in his 
marshalling of evidence for a true gradient of CR stimulus generaliza- 


tion. Dut Lashley and Wade (29) do mention it. 


The Bass and Hull Experiment (2) 


Bass and Hull worked with spatial generalization of taction, using 
eight subjects. The generalization stimuli were spaced approximate y 


uS 
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16, 32, and 48 i iti 
Wr dai ical kanary oa the inda. spot, and each generalization 
etos d 2 times. e mean CR to the conditioned stimulus 
rons $ : mm., while the means to the generalization stimuli were 
Wegen pis d dd +0.49, 4.75 +0.395, and 3.36 0.32 mm. Four of the 
SY dot showed reversals, and the subject for whom data are 
es = our successive days showed reversals on two of the four 
Ei j* js anotan way, however, the gradient obtained by Bass and 
ete eii gam from the Yale laboratories. But, of course, it too— 
peo prs s in half of the subjects and practically no difference be- 
ween the group CR to the conditioned stimulus and to the generaliza- 
tion stimulus 16 inches away—could hardly support the conclusion "ai 
Hull that ‘This generalization gradient of reaction strength is a mono- 


irapa decreasing function of the magnitude of the differences between 
the conditiond and the unconditioned? [generalization] stimuli" (21, 
ious criticism of the Bass and Hull 


p. 133). Furthermore, there is a sert 

study, as far as the gradient is concerned, namely: their alternate 
reinforcement of the conditioned stimulus and non-reinforcement of the 
generalization stimuli permitted differential conditioning by contrasts 


to develop. 
valuation of the Yale Studies 


ilarity between the results on CR 
studies. Both provide 


o the four questions asked in the first section 
hat the Yale studies were performed with hu- 
Its subject to interpretations of 
29) and of earlier experience 
ions" (Hilgard, 14). 


Summary and E 


There is on the whole a striking sim 


generalization in the Yale and in the Pavlov 


about the same answers t 
of this article. The fact t 
Iran subjects makes their gradient resu 

habits of relational thinking" (Lashley, 
with “conventional dimensions of discriminat 
Hull's statement about the “relatively non-voluntary character of gal- 
vanic skin reactions" (21) falls short in that in generalization it is the 
consciousness of the stimuli, not only of the responses, that counts, and 
human subjects are certainly conscious of the stimuli inGSR experiments, 


and consciousness of stimuli certainly affects the GSR. 
However, while human subjects could use the devices mentioned by 


Lashley and by Hilgard, they of course need not use them, and the very 
crude and irregular gradients obtained could be construed as evidence 


that they did not, at least consistently, use them. But this may be 
answered by the argument that the "sets" in the Yale studies did not 
call for conscious and active discriminations, so that these have mani- 

i incidentally (something like 


fested themselves only occasionally and 

se 1 . . B B = . 
incidental memory” OF “premature reactions” in reaction time experi- 
! Unconditioned is a very confusing adjective here. Non-conditioned would be better. 
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ments). On the other hand, the close similarity between the Yale and 
the Pavlov results should lend support to the supposition that previous 
experience was not much operative in the Yale studies. Ultimately the 
problem seems to resolve itself into the alternative of the Yale findings 
being results of (a) relational thinking superimposed upon a true CR 
gradient or (b) relational thinking superimposed upon chance, with the 


effects of relational thinking varying possibly from zero to 100 per cent. 
RAZRAN's EXPERIMENTS 


All experiments on CR generalization by the writer were with sal- 
ivary CR's in college undergraduates. His general technique of salivary 
conditioning has been described previously (42, 44, 46) and need not be 
repeated here. All that might be added are three of its more recent 
characteristics; viz.: (a) multiple intermittent one-second presentations 
of stimuli-to-be-conditioned during single continuous cating periods of 
two to four minutes, so as to provide maximum attention for the stimuli 
and to make the entire task more “molar” and meaningful; (b) misin- 
forming the subjects about the nature of the experiment, so as to fore- 
stall disrupting subjective attitudes; and (c) varying the food and 
scheduling the sessions in late mornings and afternoons, so as to insure 
adequate psychophysiological motivation. Of the three characteristics, 
the most important one is, in a way, “misinforming the subjects" such 
as telling them that the experiment aims to study ‘‘the effects of hunger 
or satiety or digestion upon eye-fatigue" (with visual stimuli) or "upon 
car-fatigue" (with auditory stimuli) or "upon memory" (with verbal 
stimuli), and actually administering some "sham" tests of fatigue or 
memory. Thus, the writer's subjects are very attentive to the stimuli, 
the responses, their own physiological and mental states; but wholly 
unaware of the experimenter's objective and his attempt to condition 
them. Their "consciousness" and "sets" have been "naturally" di- 
verted. (Occasionally a subject will "catch on,” and then his results are 


treated separately.) 
Experiments with Sensory Stimuli 


Two types of experiments were performed. In one experiment, used 
with 32 subjects, the conditioned stimuli were the flashes of two to ten 
miniature spherical lights (3 inches in diameter, 2 C.p., 2.5 volts) of 
the same or of different colors. The lights were arranged in different 
spatial patterns—many of the patterns being the same as the dot = 
rangements of Schumann, Rubin, and Wertheimer—and they n. 
flashed either simultaneously or in specially selected tempora 


quences Their generalization stimuli were (a) fewer lights, (b) lights 
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differing i i in di z 
lights = dun: (c) lights in different spatial arrangements, and (d) 
n cerent temporal sequences. In the other experiment, used 
with four subjects, the conditioned stimuli were 12 musical itor vul 
p enun due al minor seventh) in the cc! octave of a Stoelting 
a i c ien The generalization stimuli were different musical 
e. = same octave and with one common tone, different 
acre io in the same octave but with no common tone, the same 
bins n saree s in a different octave and in a different key, and the 
1e music al intervals in a different octave but in the same key. 
l'here was no attempt in these experiments to plot CR gradients but 
to a) establish the existence of CR generalization, in general; b) study 
its dependence upon amount of reinforcement of the CR; and partic- 
ularly c) study the dependence of CR generalization upon (1) the spatial, 
temporal, and color patterns in the case of lights, and upon (2) common 
ratios, partials, and absolute tones in the case of the musical intervals. 


The results showed: (a) unmistakable evidence for the existence of CR 
esser magnitudes of generalization CR's, (b) 
the course of CR generalization to the light 
ciples of patternization" of Wertheimer 
Pavlov, (c) the determining roles of 


cal intervals. 


(excluding the natur; 


generalization and for the | 
impossibility of predicting 
patterns from either the "prin 
or the simple CR generalization of 
ratios in the CR transfer from musi 
hic Generalisation (47, 49) 

The conditioned stimuli were single 
lization stimuli were words related to the 
d spelling, a relatedness named by the 
CR gradient was found here. Thus, the 
nes (urn— "earn" style— "stile"; surf 
c got: freese— "frieze") was 37 per cent, while the generalization 
from flower to "glower" was 35.1 per cent, from dark to “mark,” 31.6 


per cent, from mock to "dock," 27.2 per cent, from flower to “shower,” 
“may,” 19.6 per cent. The t's here are 


20.2 per cent, and from day to : 
significant with P equalling 05, if the differences are more than four 
per cent. 


Phonetograp 


_ Four subjects were used. 
English words, and the genera 
conditioned words in sound an 
writer phonetographic. A crude 
mean generalization to homopho 


ation (49) 


the conditioned stimuli were 
were word related 


contrasts, coordi- 


Semantic Generalis 
_ Four to 11 subjects were used, and 
single English words. The generalization stimuli 
to the conditioned words semantically; synonyms, 
nates, supraordinates, subordinates, whole-part's, part-whole's, and so 
forth. The mean generalization to the synonyms was 59 per cent, while 
the generalization to the other word categories was the greater the higher 
the frequency of such word categories în free association tests and the faster 
their reaction time in controlled association tests (61, 32). However, this 
ditioned and 


relationship held only if the relatedness between the con 


— 


> 
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the generalization words was classified according to “relatedness of 
conditioned words to generalization words," and not as a “relatedness 
of generalization words to conditioned words”; e.g., only if a conditioned 
word of dog" and a generalization word of “animal” or a conditioned 
word of "flower" and a generalization word of “petal?” were classed 
respectively as subordinates and as part-whole's, and not as supra- 
ordinates and as whole-part's. The significance of this finding will be 
discussed in a later section. 


Effects of Mental Sets upon Semantic CR Generalization (50) 


This study was largely an extension of the one preceding. Before 
being tested for generalization to various word categories, nine subjects 
were divided into three equal and equated groups. Group Ci practiced 
corresponding word categories in controlled association tests, C£» 
practiced supraordinates before being tested for CR generalization to 
supraordinates. Group C: practiced converse word categories, €.g., prac- 
ticed subordinates before CR tests for supraordinates, while Group Cs 
was not subjected to any such preliminary practice. Asa result,Group 
Cs, the control group, showed a mean CR generalization of 34 per cent. 
Group Ci that practiced corresponding word categories manifested a 
mean generalization of 54 per cent, and Group C2 that practiced con- 
verse categories, a mean of 25 per cent. The t's here are significant 
for all three group differences with P equalling .01. 


Effects of Cognitive (Knowledge of Stimulus Relations), Voluntary- 
Facilitatory, and Voluntary-Inhibitory Attitudes 


Twelve subjects, divided into four equal and equated groups, were, 
first, conditioned to the tone Cona Stoelting just harmonium and to the 
word flower ; and, then, tested for generalization to (a) tones F, B, e, @, 
dug. f#, and to (b) words “four,” "glower," "shower," and 
“scour.” The cognitive attitudes for tone generalization were induced 
by the instructions of: "When you hear a tone in this session, try to 
think of its relatedness to the tone with which we experimented in the 
last two sessions. Try to think whether the tone that you will hear is 
higher or lower and how much higher and how much lower than the 
tone with which we experimented.” The voluntary-facilitatory attitudes 
were instilled by: upd like to see how well you can control reactions. 
You were conditioned to secrete saliva to a low tone on this instrument 
[conditioning explained at some length]. From now on, you will hear 
eight higher tones, each about two or three musical intervals higher than 
the other, and it is your task to secrete most saliva to the tone closest 
to the conditioned tone—less, though, than to the conditioned tone 
itself —and least saliva to the tone most removed from the conditione 
tone [all eight tones are now demonstrated]. In other words, I would 
like you to build up a saliva scale that will correspond as closely as 
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ure tie scale of tones here.” The instructions for the voluntary- 
mode: E differed from those for the voluntary-facilitatory 
Un E nat dat the subjects were told to "secrete an equal amount of 
for "lm 1 tone irrespective of its pitch or quality.” The instructions 
Eni ptem icm were similar to those of tone generalization 
sien aen the subjects were also told "not to pay attention to word 
1 i meaning but only to that of sound and spelling." Finally, 

was a control group whose subjects were not given any instructions 


about the generalization. 

E... a dee are presented in Table VI. Space forbids a detailed dis- 

presen Tg the table. But a summary statement can be made: within the 

euin of the present experiment cognitive and voluntary attitudes modified 
ily a little the subjects! gradients of CR generalization. 


TABLE VI 
KNOWLEDGE OF STIMULUS RELATIONS), VOLUNTARY- 
FACILITATORY, AND VOLUNTARY-INHIBITORY ATTITUDES UPON STIMULUS 
GENERALIZATION OF SALIVARY CONDITIONING IN HUMAN SuBJECTS 


r T he subjects were 12 college undergraduates and the conditioned stimuli were (a) the 
we C on a harmonium and (b) the word flower, Each entry is a mean of six determina- 
ions, two from each of the three subjects. VF =voluntary-facilitatory attitude, VI= 


voluntary-inhibitory, KR =knowledge of stimulus relations, UI —uninstructed. 


LerECTS OF COGNITIVE ( 


I 


Per Cent of Generalization 
Induced Generalization Stimuli Reliable 
Altitude Tones Gradient 
Steps 
F B e a d go oe JE (estimated) 

UI s o s @ S X9 T 38 Three 

VF a 2 44 » 9» r a0 M Three or Four 

VT zs om 6 m 5 5 i2 48  Twoor Three 

KR Q s so n 9 40 32 3l Three or Four 

Words 
Flour Glower Shower Scour 

GI 39 29 32 19 Three 

VF 62 68 52 23 Three 

MI 68 72 60 48 Three 

ER 54 4l 45 26 Three 


Transliterated Russian Words 
with Russian, were divided into three 


conditioned to four transliterated Russian 


T, and SMESHNOY. Group N was 
G was told the meanings of 


The Experiment with 


a Nine subjects, unfamiliar 
eis and equated groups and 

ords: BUKVA, DOLGO, KUPI 
not told the meanings of the words. Group 


y 
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the words before being tested for generalization, while Group CG was 
given one set of meanings before the primary training of the CR, and 
another reverse set of meanings before the generalization tests (having 
been assured that the second set was the correct one). The subjects in 
Group N did not, of course, show any significant semantic CR gen- 
eralization. But the important point brought out by the experiment was 
that the course and range of the semantic generalization was deter- 
mined entirely by the meanings given to the Russian words before 
the generalization tests. The meanings imparted prior to the original train- 
ing of the CR had no effect upon the subsequent semantic generalisation of 
the CR to the verbal stimuli. 


Summary of Rasran's Results 


Besides corroborating in the main the findings of the Yale and the 
Pavlov laboratories (as analyzed by the writer), the writer's results 


indicate that: 
1. With human subjects, gradients of CR generalization, though very crude 
ones, are more evident with verbal than with sensory conditioned stimuli. 

2. With musical intervals as the conditioned stimuli, common ratios deter- 
mine much more the course of CR generalization than common constituent 
tones, either fundamentals or partials; with flashes of lights as the conditioned 
stimuli, the spatial patterns of the lights are more determining than the colors 
of the lights in the patterns; and with verbal conditioned stimuli, relatedness in 
meanings is more significant than relatedness in sound and spelling. 

3. Mental sets are effective in determining the general direction and magni- 
f CR generalization but do not seem to be of much significance— without 


tude o l : j 
eating or in counteracting the gradients of the generaliza- 


special training—in cr 


tion. 
4, The course and very existence of CR generalization is more likely a func- 


tion of the subsequent testing for generalization than of the original training of 


the conditioning. 
Tug TorAL EVIDENCE AND THE LASHLEY-WADE INTERPRETATIONS 


Total evidence adduced heretofore is, in the writer's estimate dis- 
cordant with either (a) the views of Pavlov that CR generalization isa 
function of the undulation of cortical excitation or (b) the iowa of Hull 
that the generalization gradient is a monotonic decreasing funetion of 
the magnitude of the differences between the conditioned ‘and the gen- 
eralization stimuli. But the possible concordance of the evidence with 


" " : -— 

a ie study by the w riter (50), the CR generalizations of four subjects, who were 

€ he controversial sentences such as "socialism is desirable," scemed to be 

ni 2 soti politen) views of the subjects, while the conditioning itself showed 
z: aces. How ever, these results were obtained from a very small number of 

subjects and must be considered very tentative 
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he Lasl f erpretations—out ned ir 
shiey i i i 

iis i: ley and Wade int P 1 li i icti 
eserves careful examination. 


Generalisati j jati 
eralisation as a failure of association. There is no doubt in the 


writer’s mi : i 
mind that a good deal of presumed evidence of CR generaliza- 


tion is due ae iA 
to a failure of association. To Lashley and Wade's in- 


Sightf EXEAT 
8 i Pons eget he would add the following experimental support: 
AN sn poser dox. vite am Pavlov's laboratory (24, 27; 37, 53, 62, 
taetlon, lett and biu p ee ieee = compound stimulus—light and 
Somali fared te F 1 + di eren soun s or lights or tactile stimuli—had 
MR ien produce a CR, when only one of the stimuli, supposedly the 
od ne was presented alone. There was no generalization to these compo- 
alin leer p stimulus, even though they were physically present dur- 
stile ae TU CR training and even though they obviously were physically 
terpretate ne qoa stimulus (partial identity.) And the only sensible in- 
hee Falls this phenomenon would seem to be that the animals failed to 
the T € particular component stimuli with the total CR situation, that 
ieee uli were not attended to," were present but ineffective. Pavlovian ex- 
es ee that in such cases the weaker stimuli becomeinhibited by the stronger 
Wee ntradict their own doctrine that all aspects ofa stimulus are associated 
inb 1e conditioned response. Or, to put it differently, if weaker stimuli are 
vel by stronger stimuli during primary conditioning, why do not the 
er generalization bonds become inhibited by the stronger conditioned 


bonds, and why is there generalization altogether? 
diit a number of experiments from Pavlov's 
B great rà badly injured or specially drugged dogs ar e 
describ vi of CR generalization. And a widening of generali 
these ed in old dogs, young puppies, and in lower animals. 
ge "puse associative capacity is no doubt reduced, an inter; 
neralizations in terms of failure of association seems very reas 


laboratory, partially decor- 
e often reported to manifest 
zation is also often 
Inasmuch as in all 
pretation of their 
onable. 


However, against these two considerations the writer would pit four 


Others vi 
t vIZ.: 
old dogs have also been reported 


and injured, and i 
greatly the scope of their general- 


1. Partially decorticated 
having reduced 


by P 

k avlov and hi es 

ization. is students a: 

the » The frequent increase of CR generaliz 
R—which the writer believes to be an es 


Wi = s 
ith a “failure of association" interpretation. uu , 
3. Failure of association could hardly apply to the generalizations obtained 


by Grandine and Harlow (9) and by Grice (10) who used a Lashley-Wade tech- 


nique, 
ex oa Failure of association could not explain the generalizations in the writer’s 
Periments in which the subjects attended very minutely to the stimuli, and 
ions in other experiments with human sub- 


Probably t 
co not too well the generalizat 


stages of training 


ation in the initial 
does not go well 


tablished fact— 


of association may account, in the 


In other words, while failure 
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writer's estimate, for a good deal of reported CR generalization, it does 
not explain al] CR generalization reported. Probably Lashley and 
Wade did not themselves mean it to do so. 

Generalization as a product of variable stimulus thresholds. Again, it 
seems well probable that some data on CR generalization are to be at- 
tributed to variable stimulus thresholds. In spatial generalization of 
taction, Loucks (30) actually demonstrated what he called peripheral 
mechanical irradiation by stimulating a point not too far from the 
conditioned spot with a “pricker” obtained from Pavlov's laboratory. 
And the writer pointed out earlier in this article the probable masking 
effects of intensity in CR generalization. Still, there is a limit to variable 
thresholds, and the writer doubts very much whether such a concept 
could be of any value in interpreting CR generalization to stimuli far 
removed from the conditioned stimulus, inter-dimensional and inter- 
sensory generalization, generalization by the Lashley-Wade technique, 
and generalization in the writer's experiments. 

Generalization along discriminated dimensions. The main argument 
here is the Lashley and Wade statement, quoted earlier, that “Tt [true 
generalization] does not occur in conditioning to a single stimulus but is 
somehow a function of differential training with two or more stimuli 
on the same dimension" (p. 82). This apparently mcans that, according 
to Lashley and Wade, whatever CR generalization cannot be explained 
by “failure of association” or by "variable stimulus thresholds," its 
cause must be sought in the reactional biographies of the stimuli in- 
volved, a laudable but difficult matter. To be sure, reactional biog- 
raphies are very important, and the writer could point to the experi- 
ments by Prokofiev and Zeliony (38) and particularly to the one by 
Brogden (6) on sensory pre-conditioning which show the interactions of 
sensory stimuli in animals without even reinforcement; as well as to the 
effects of previous experience on CR gradients in his own experiments. 
Yet in the 67 studies from Pavlov's laboratory which the writer analyzed 
he ruled out results with dogs that had had, as far as could be ascer- 
tained, some previous experience with the generalization stimuli. And 
in the writer's own experiments, discriminative sets modified but did 
not wholly change the character of the subjects' CR gradients. Further- 
more, on general grounds and analogies, Lashley and Wade's total denial 
of the existence of a true single-stimulus absolute value generalization is 
xot wholly borne out by their own statement. They state: “ 
2p- v for relations is much more permanent than memory for ab- 
solute Pi5perties" (p. 86), which to the writer means that transposition 
is more Significant than generalization, a view with which the writer 
tends, in a large way, to agree (cf. 44). But to the writer the statement 
also means an admission of the existence of memory for absolute prop- 
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erties and, by analogy, the existence of a true single-stimulus CR 
generalization.? 

THE Writer's OWN INTERPRETATIONS 
f CR generalization rest upon 
“subsequent testing” hypothe- 
“rating scale” type CR gradi- 


The writer's own interpretations O 
three considerations: a bifactoral theory; a 
sis; a doctrine of a qualitative, categorizing, 
ent. 

The bifactoral theory. 
maintains that there are two 
generalisation and (b) true general 


The bifactoral theory of CR generalization 
kinds of generalizations: (a) pseudo- 
ization. Pseudo-generalization, well 


accounted for by Lashley and Wade's interpretations, correlates, for 
the most part, negatively with organismic capacity. It predominates in 
the CR generalizations of lower animals (fish, guinea-pigs), in partially 
decorticated or very young or very old higher animals, in states of 
fatigue and inattention, and in some stages of CR training (initial, over- 
training). True generalization, on the other hand, is a positive capacity 
of the organism, an ability to generalize absolute characteristics of 
stimuli or objects. It seems to increase with maturity, alertness, post- 
Operative recovery from cortical extirpations, administration of some 
drugs, and other capacity-enhancing influences.'? Analogically, perhaps, 
the relation of pseudo-generalization to true generalization 1s not unlike 


that of the undifferentiated total action of the young foetus to the 
structured whole activites of the fully developed individual. — " 
The "subsequent testing" hy- 


The subsequent testing hypothesis. n 
Pothesis asserts that CR generalization develops, not during the 
but during the subsequent 


original training of the conditioned stimuli, b : 
testing of the generalization stimuli. The writer agrees here fully with 


Lashlev and Wade that "there is no ‘irradiation’ or spread of effects 
during primary conditioning" (P- 74) and disagrees with the assump- 
tions of Pavlov and Hull and Spence that during original CR training 
Some sort of generalization bonds, latent but later ready to function, are 
automatically developed. He believes that such assumptions are o 
only cumbersome physiologically and superfluous logically, but also 
? The writer agrees also with Lashley and Wade that the “basic nervous mechanism is 
9ne of reaction to ratios of excitation” (p. 86). But he believes that some ratio of excita- 
tion is also involved in simple conditioning, à ratio between the reaction-patterns of the 


condition SS = ili 
ed and the unconditioned stimu. . . 
ie to be able to discuss here the evidence for these assertions. 

fully analyzed yet, 


A er is that the evidence has not been r Á 
n f reading Russian 


r n impressions © 
e thus based only upon 
pon an earlier, not completely up to date, re- 


e Writer regrets not 
ana i son is lack of space. 
Sthitien, © these assertions ar à 

5, and in the case of lower animals u 


vi p 
Ew by the writer (41) of the topic- 
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that they are contradicted by a series of his own experiments. According 
to his view, all effects of generalization are generated during tests of 
generalization." 

The doctrine of a qualitative, categorizing, "rating scale" type of CR 
gradient. This doctrine is in a way the crux of the writer's interpreta- 
tions. It is based upon a statistical and a logical analysis of total evi- 
dence which, in the writer's estimate, demonstrates that there is a true 
CR gradient, but that this gradient is very qualitative and very crude, 
consisting of only a few steps, perhaps more steps in human beings than 
in dogs, but few just the same. Apparently, when human beings or dogs 
that have been conditioned to some stimulus or object are confronted 
with some new non-conditioned but in some way related stimulus or 
object, they categorize or rate the new stimulus on some sort of crude 
similarity-dissimilarity scale. With human subjects, introspections 
actually reveal such categorizing attitudes as "similar," "very similar," 
"not so similar," "somewhat similar," “dissimilar,” “very dissimilar," 
and the like—attitudes that apparently control or even initiate the 
generalization responses. And it is the writer's contention that some 
such categorizing behavior is operative also in animals. 

This categorizing is thought by the writer to be very dynamic and 
changeable and varying much more with the organic dimensions of the 
organism than with the physical dimensions of the external stimuli in the 
CR situation (data deny significant variations with the latter). Its 
neurophysiology is regretfully obscure, as regretfully as the neuro- 
physiology of learning in general.” But it should not be difficult to con- 
ceive the organic dimensions with which it might correlate: perhaps the 
movement-produced stimuli of Guthrie, or the situation-sets of Wood- 
worth, or even the means-ends-capacities of Tolman, At any rate, 
qualitative step-like CR gradients and only such gradients are objec- 
tively demonstrable. So that those of the writer's colleagues who 
fear anthropomorphism more than zoomorphism need accept only the 
re of gradient and not its human analogy. While to others, rat 

o less welcome than hypotheses in rats. 


GENERAL SUMMARY 
Four views of CR generalization have been considered: 


t Wh, A (Pavlov, Bekhterev): CR generalization is a func- 
n of a wave-like irradiation of cortical excitatior. It posi itati 

i ; " tsa con- 
tinuous CR gradient. = ane 


2. Physico-behavioral (Hull, Spence): CR generalization is a monotonic de- 


1 [t might be worth consideri imi i 
ing a similar hypothesis for tra iti re 
complex transfer phenomena. - sie a 
12 The writer is much more in accord with Loucks than with Skinner and Hull about 
the need and value of correlating behavior with neurophysiology; but he does defer to 
Lashley about the paucity of our knowledge in this area. 
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creasing functi A = 
at m = of the magnitude of the differences between the conditioned 
mus d he generalization stimuli. It, si itati i 

ous CR sans aie , too, posits a quantitative continu- 
3. Fa x = "1 H my 

afi ui of association-trans position (Lashley and Wade): some reported 

anenai R Gee are due to failure of association; others represent 
spositions of discrimina i i i <i 

Sten Vern. ted dimensions. It denies the existence of true CR 
4. Categorisi j "hi i 

wie ee (Razran): While recognizing the roles of failure of as- 

pe aes and transposition, it affirms the existence of a true CR generalization 

ane attributed to the organism's categorizing or rating of related stimuli 

sort of crude similarity-dissimilarity scale. It posits a very crude and 


qualitative CR gradient. 


Adduced evidence favors the fourth v 


1 p r 
nent with the first two views.” 


iew and is clearly in disagree- 
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PSYCHOTHERAPY AS A PROBLEM IN 
LEARNING THEORY! 


EDWARD JOSEPH SHOBEN, JR. 
State University of Iowa 


It has become increasingly apparent that clinical psychologists are 

more and more drawing psychotherapy into their compass of activities. 
If this enlargement of scope is to be something more than a trading 
of one's psychological birthright for a share of psychiatric pottage, it 
would seem imperative that the therapeutic functions of the psycholo- 
gist be regarded from the point of view of research as well as from that 
of practice. As Sanford (47) puts it, 
What should be of great help to us here is our training in scientific method and 
our tradition of research-mindedness. It would be hard to name an area in 
which research is more needed than it is in therapy, or an area in which what is 
being done lags further behind what might be done. . . . And one might say, 
furthermore, that it is primarily up to the psychologist to perform this needed 
research. 


The difficulties in the way of such inquiry, however, are enormous, 
as is well attested to by the paucity of investigations of the therapeutic 
process in terms of the problems, techniques, and concepts common to 
general psychology. The nature of some of these barriers to psychologi- 
cal research on a matter of such importance probably merits some brief 
attention. 

In the first place, there are situational deterrents to research in 
psychotherapy. Counseling? usually takes place in a “service” setting 
and is consequently seldom subject to the kinds of exact manipulation 
required by rigorous experimentation. Often, attempts to control vari- 
ous factors in the therapeutic set-up give rise to serious ethical problems 
concerning the relationship of the therapist and his agency to their 
clients, and certainly the pressure of the demand for counseling services 
frequently conflicts with the requirements of a research program. Sec- 


1 This article represents a revision and extension of an earlier attempt (51) to con- 
ceptualize psychotherapy in terms of systematic behavior theory. Acknowledgment 
must be made to a number of people, foremost among whom is Dr. O. H. Mowrer, who, 
though he may recognize some of his ideas in the ensuing pages, must not be held re- 
sponsible either for their form or for the uses to which they are put. Others are Dr. 
Kenneth Spence and Dr. I. E. Farber, of the University of Iowa, who have been in- 
valuable sources of stimulation and instruction but who are absolved from any re- 
sponsibility for what is here said. 

2 The terms counseling and psychotherapy are here used interchangeably without 
regard for any of the distinctions they are sometimes employed to convey. 
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ondly 
En m Uy e of complexity gives one pause. Psychotherapy is a 
ERE E : interaction, an active social situation, in which many 
esa cu E-be-ibblate aspects of the personalities of both patient and 
e es ban be taken into consideration. The therapist is not merely 
"im ied some supposedly meliorative technique but is deeply in- 
hoa a Vien dod in the counseling process. Thus, the psychology 
idi I 3 chologist, as well as the psychology of the patient and the 
of the therapeutic method, enters into the determination of the 


thera A. eus 
rapeutic end product. Third, there are personnel problems militating 


against effectiv . 
gainst effective research in psychotherapy. Psychologists most fa- 


miliar with the therapeutic process are seldom well schooled in the ex- 
b ai MS conceptual skills basic to fruitful investigations in gen: 
eed. c e ogy, whereas those who are best equipped technically and 
raS E ly as research workers are generally rather untutored in 
mate sins are unfamiliar with clinical material, and are 
re Ta ly repelled by the admittedly gross and somewhat nebulous 

ions clinicians use in their efforts to conceptualize the complex phe- 
nomena with which they work. In sum, the situational lack of amena- 


bility of psychotherapy to experimental inquiry, the enormous com- 


plexity of the factors entering into the counseling process, and the dif- 
TE in training and interest between clinical and laboratory workers 
tend to impede a rapprochement between psychotherapy and the re- 


search functions characteristic of general psychology. 

In spite of these difficulties, there is one slender lead that might be 
ies iyd followed in the attempt to provide a basis for the concep- 
ualization and investigation of psychotherapy as a problem in general 
Psychology. This is the widespread recognition that psychotherapy is 
essentially a learning process and should be subject to study as such. 

, This point of view is not only in harmony with the general concep- 
tion of counseling as a conversation or series of conversations between 
two persons, therapist and patient, the goal of which is to resolve the 
Conflicts, reduce the anxiety, or somehow modify the behavior of the 
ele conception which clearly implies learning; it has been more or 
ss clearly so verbalized by a number of clinical workers. i Cameron (4) 

ces the desideratum of counseling as the patient's "acquisition of nor- ~ 


mal biosocial behavior,” a statement which definitely implies the learn- 
ý H 
on of the therapeutic process. 


aL of new ways of reacting as a functi ‘ : ic r 
t xander and French (1) advance as a basic therapeutic principle the 
exposure of the client, within the favorable circumstances of psycho- 
therapy, to emotional situations with which he was unable to deal in 

for such a reexposure rests on 


th LAS : 
€ past. Presumably, the justification 
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the hypothesis that its occurrence "under more favorable conditions" 
in some way permits the patient to learn more adequate ways of coping 
with such experiences. Rogers (45) describes the therapeutic process as 
a freeing of the "growth capacities" of the individual which permits 
him to acquire "more mature" ways of reacting. If "growth" in this 
context means (as it must) something more than physiological matura- 
tion, and if it is not to be lumped with the old and rather mystic homeo- 
pathic notion of the vis medicatrix naturae, it must refer to the client's 
acquisition of new modes of response. Such new modes of response are 
“more mature” because for a given patient they are less fraught with 
anxiety or conflict. Thus, Rogers is actually talking about psycho- 
therapy as a learning process. White (64) insists that, "Psychotherapy 
is designed to bring about learning... "; and Darley (7) argues that 
unless the process of learning in counseling is demonstrated, it is not 
legitimate to infer that the modifications of behavior that may occur 
during or following therapy are necessarily outcomes of therapy. 

In spite of this widespread acknowledgement of psychotherapy as a 
learning process, there have been few attempts (11, 25, 49, 50) syste- 
matically to formulate therapy in terms of learning theory. This paper 
represents a tentative, apologetically offered effort to construct a 
learning-theory interpretation of counseling that will help to narrow the 
gap between practitioner and researcher, clinician and experimentalist, 
and to encourage some much needed investigation. 


CoMMON FACTORS IN ScuooLs OF PSYCHOTHERAPY 


When one surveys the various theories and practices of psycho- 
therapy in an effort to find those common factors which a learning- 
theory interpretation of the counseling process must cover, it appears 
possible to make four summarizing general statements: 


1. All schools of psychotherapy can with some justice claim cures (46). Not- 
able successes seem to be the common property of virtually all forms of counsel- 
ing from moral suasion through non-directive therapy to psychoanalysis. 

2. Clinical patients, in spite of their enormous differences, tend to present 
oh ccna des ee in that one of their primary motivations is anxiety and much 

ir non-integrative or "symptomatic" behavior is maintai i 
of anxiety reduction. Fur en si basia 
f 3. „The goal common to most psychotherapies is the modification of the 
client s underlying anxiety. This is related to the hypothesis that once his moti- 
vation is altered, the overt habit structures of the patient will change. 


à itus -— " r " . 
The mn e spoken of in this paper include only those classifiable as 
neurotic or "mala justed. Nothing said here is meant to apply to psychotics, psycho- 
paths, or behavior problems associated with endocrine disturbances or lesions of the 
central or autonomic nervous systems. 
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4. Finally, all types of counseling employ the techniques of the therapeutic 
relationship, the unique social situation that is formed when therapist and pa- 
tient meet to discuss the problems of the latter, and of conversational content, 
that is, of talking about certain things within the therapeutic setting rather than 


others, 


A word must be said about each of these four factors which seem to 
be common to the various forms of counseling, regardless of the doc- 


trinal banners flown. 


If it is true that the proponents of various 


All schools report cures. 
able to claim successes, and if it is 


theories of psychotherapy all seem 
true—as has often been pointed out—that successes are no proof of 


therapeutic theory, then it would seem to follow that an understanding 
of the counseling ‘process would be furthered by giving more attention 
to the conditions under which the patient's learning of new modes of 
reaction takes place within the general clinical setting. If this is a fair 
notion, based as it is on the conception of therapy as a learning situa- 
tion, it might be instructive to explore the points in common among the 
different approaches to counseling in terms of (a) the similarity of 
patients’ problems, (b) the agreement among clinicians as to goals, and 
(c) the techniques common to nearly all therapeutic enterprises. Such 
an exploration might lead to a formulation of the learning process in 
counseling in terms of these three sets of information. | . 

Similarities in clinical cases. While from the practical standpoint of 
dealing therapeutically with patients it is necessary to consider each 
case in all its uniqueness, from a theoretical point of view it Is instructive 
to look for similarities. This amounts to asking the rather ambitious 
Questions of (a) What constitutes the core of “neurosis s. malad- 
justment’’? and (b) What are the common problems faced by t pie im 
in their contacts with patients? While no definitive answer lg pem n 
here, it is important to consider these issues as bearing on - goals s 
techniques employed by counselors of different hearen — s 
Le as factors to be accounted for in attempting to formulate a learning 
theory interpretation of the therapeutic process. f "WP 
H A pom pe which there seems to be widespread A all neuroses 

orney’s (17) phrase, that "one essential a, ng 

-is din oo Moni the defenses built up against them. te even 
anus feelings are all variants of anxiety m TT ise, it would seem 
dal vs expectations f future DOR (6) and Rogers (48) 
“Tat the “phenomenological 5€ 707 77 BAS DTE. for idc 
refers ; atient's level of anxiety, guilt, 1 
Ubrum eee ^ erbalizations, accurate or otherwise, of his 


efenses agai ue 
“ses against them. TE cri 
To conceptualize anxiety usefully, 1t 5 r f rtt neurotic 
tween anxiety and fear or, as Freud (12, 13, 14) did, be 


minate be- 
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anxiety and objective anxiety. Fear may be thought of as an affective 
reaction proportionate to some external danger. Anxiety, on the other 
hand, differs from fear in at least two ways. First, if one asks a "'neu- 
rotic" patient what he is afraid of, he will admit to being afraid but will 
generally have no idea of what the source of the possible danger might 
be. Anxiety may be aptly termed either a fear of “nothing” or a fear of 
something which is objectively irrelevant. Sccond, while both fear and 
anxiety are anticipatory states involving some kind of premonition of 
danger, the signal to which anxiety is a reaction is usually internal, some 
impulse to act in a Way that has been forbidden. An illustrative case 
may clarify this point. 

E. B., a 24-year-old male undergraduate veteran, despite slightly better than 
average academic ability, is making poor grades and is in danger of being dis- 
missed from his university. He complains of being "unable" to study, feclings 
of inferiority in social groups, and serious doubts as to both his intellectual and 
social adequacy. He has some guilt feelings about having transferred from a pre- 
medical curriculum to English, because his parents are quite eager for him to 
become a physician. His father is a farmer who.has been quite successful finan- 
cially and in community politics, and who has been highly ambitious for his 
son. He has imposed very high standards of attainment on the boy, has been 
quite strict and stern with him and has had a number of set ideas which he felt 
that the youngster should accept and act upon “for his own good." Any devia- 
tion on the part of the patient from the parentally prescribed ways of doing was 
met with severe punishment, the verbal part of which usually consisted in a 
variety of changes rung on the theme of the boy's worthlessness and a series of 
predictions that he would come to no good end. In short, any self-initiated 
activity— behavior which the parents themselves did not lay out— was fraught 
with danger. When the boy began counseling, he was squarely on the horns of 
a dilemma: unable to meet parental demands for a variety of reasons, he was 
also unable to initiate any divergent plans of his own without experiencing a 
flood of anxiety, i.e., anticipations of parental punishment. 


This, if it is acceptable, leads to a general formulation of non- 
integrative or neurotic behavior. Anxiety has repeatedly been shown to 
have drive properties (29, 32), and on the basis of the anxiety drive, in- 
dividuals who are maladjusted seem to develop various overt reaction 
patterns that become stable according to the degree to which they 
reduce the anxiety. This statement in terms of con temporary reinforce- 
ment theory (19) is quite in keeping with Freud's (12) idea of the inter- 
changeability of anxiety and symptom, by which he means that through 
the formation of symptoms the patient protects himself from anxiety 
attacks. Anxiety is allayed by some anxiety-reducing symptom; if 
the symptomatic behavior is somehow eliminated, the anxiety returns. 
On the basis of this notion it is possible to define a neurosis or a malad- 
justment in terms of behavior which serves to reduce anxiety directly 
without altering the conditions which produce the anxiety. Freud con- 

sistently refers to anxiety as a signal of impending danger; the malad- 
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Justed person is one who cith i 
adhue er consciously or unc i i 
D dian eerie or neutralize the rast deem a 
ine d cled, by is in the position of the motorist who shuts his 
aes dea 2 dangerous curves, thus protecting himself from 
ae iuc liable to serious accidents. 
fiis gacctece ~~ gs an explanation of the curious observation 
Sr Chee. se us behavior is at the same time self-defeating and 
HT M (4 dne fae t is self-defeating in that such behavior leads in- 
Meroe calla - piinishment: the motorist has accidents; the il- 
rouer his sie un ers ME failures and social disarticulation 
pi a manes ol study to protect himself from the anxiety en- 
alib» qu PR rehenes activity and his withdrawal from social 
EM seb ire his worthlessness. It is self-perpetuating because of 
ru s ae meinfbreemènt derived from anxiety reduction. Since the 
tienes a reinforcing state of affairs lies on the temporal gradient 
es foe — in greater proximity to the anxiety-reducing behavior 
aaa s the more remote punishment, the connection between the 
al and internal cues of anxiety and the non-integrative response 


tends to be strengthened (38). 
a MD concept in a theory of anxiety is that of repression. This 
iis pe to the exclusion from communicability (consciousness) of 
a hila se to act which has led to punishment. When a parent punishes 
Rea tn for some tabooed act, the impulse to commit such an act 
P gom its association with the punishment, a stimulus for 
mmis ne way by which the anxiety may be avoided is through 
ilem i on—the exclusion from awareness of the impulse. If the repres- 
Ci ri ai there is a thorough-going allaying of anxiety, and the 
Iden impulse no longer constitutes a problem. 
eid arises because repression is seldom if ever complete. The 
which ual is constantly threatened by “a return of the repressed” (14) 
the touches off anxiety without the patient s being able to verbalize 
or cues for it. In short, the repressed impulse, although excluded from 
Pel cra is still operative at subliminal levels. Why this 
Boe d be true is something ofa psychological mystery, although some 
San is shed upon it by investigations of punishment. Estes (9), for 
not ple, by a series of experiments has shown, that punishment does 
extinguish a response which has been positively reinforced. He 


concludes, 
m's repertoire more rap- 


idly à response cannot be e 1 
With the aid of punishment than without it. In fact, severe punishment may 
The punished response continues to exist 


avi 5 
i precisely the opposite effect. -.- "18 P igi fo exis 
organism's repertoire with most of its original latent strength. While it 1s 
t only protected from extinction, but it also may 
' or "dread," 


Sup 
"eii the response is no 1, bu 
whi x a source of conflict. An emotional state, such as “anxiety 
ich has become conditioned to the incipient movements of making the re- 


liminated from an organis 
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sponse will be aroused by any stimuli which formerly acted as occasions for the 
occurrence of the response (pp. 37-38). 


This provides a neat parallel to what is implied in the concept of repres- 
sion. 

In summary, then, one might say that clinical cases share in common 
(a) anxicty touched off by (b) unverbalized, unsuccessfully repressed 
impulses to act in ways that have met with punishment, and (c) per- 
sistent non-integrative behavior of many kinds, which reduces the 
anxiety but does nothing about eliminating its objective causes. 

Common goals in psychotherapy. In spite of its non-integrative 
nature, overt neurotic behavior acquires remarkable persistence 
through anxiety-avoidance. This persistence is probably the factor 
most responsible for the failure and consequent elimination of clinical 
techniques aimed at the climination of symptoms. Such a goal, in effect, 
defined psychotherapy as a process of robbing the patient of his de- 
fenses against anxiety without alleviating the unbearable state of dread. 
Since such an end is impossible of realization, advice, persuasion, ex- 
hortation, and suggestion have largely gone by the board in favor of 
methods which focus on the client's anxiety itsclf. 

In other words, the goal of most modern psychotherapies is the 
modification of the emotional determinants of neurotic behavior. Thus, 
Alexander and French (1) speak of therapy as “ʻa corrective emotional 
experience," which presumably results in a diminution of anxiety and a 
consequent elimination of persistent non-integrative behavior from the 
patient's repertoire. Likewise, White (64) points out that ‘‘Psycho- 
therapy does not take place primarily in the sphere of intellect . + 
Its sphere of operation is the patient's feelings.” The kind of learning 
with which counseling is concerned has to do chiefly with the alteration 
of motives and affective drives. This does not mean, of course, that the 
therapist is uninterested in his client's overt behavior; on the contrary, 
it is his job to help the patient alter it and achieve a repertoire of more 
integrative habits. But since this goal does not seem attainable through 
any kind of direct manipulation, the counselor generally works on the 
elimination of the basic anxieties, implicitly hypothesizing that once the 
drive conditions are changed, the neurotic behavior will show less 
strength. 

Common tools in psychotherapy. From the standpoint of technique, 
there are two main aspects of the counseling process, common to al 
schools of psychotherapy. One is the unique relationship that develops 
between therapist and patient; the other is the conversational content, 
what they talk about during their sessions together. The proponents 
of different theories of counseling may emphasize one or the other © 
these factors, but both figure in their final formulations of therapeutic 
procedure. Thus, Williamson (66) and Kraines (24) stress the therapist 
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obtaining personal information from the client so that the counselor may 
guide him somehow to a higher level of adjustment. In spite of this 
emphasis, both these clinicians devote a good deal of attention to the 
necessity of establishing and maintaining rapport or winning and 
retaining the patient's confidence. On the other hand, therapists like 
l'aft (60), Allen (2), and Rogers (44) play up the quality of the counselor- 
client relationship and are concerned only secondarily with the con- 
versat ional content aspect of therapeutic interviews. Nonetheless, they 
are quite insistent that the proper content of counseling contacts is the 
feelings" of the patient rather than his overt behavior or his in- 


tellectualized beliefs. 

f What is this content factor in counseling? What are the areas of 
discussion between counselor and counselee? In line with the foregoing 
(although at variance with a widespread belief among laymen), thera- 
peutic conversations are concerned with the patient's overt behavior 
only insofar as it bears on his covert reactions—the anxieties from which 

atively defends himself. 


he suffers and against which he so non-integr: 
The client's anxiety (guilt feelings, feelings of inferiority, or in- 


adequacy), then, constitutes the central topic of concern in psycho- 
therapeutic interviews. But clinicians are also interested in the oc- 
currences that engender anxiety. Especially are they interested in the 
formative past experiences* which have been associated with anxiety, 
and they encourage patients to discuss such events and their reactions 
to them rather fully. Emphasis throughout seems to be more on the 
way the client feels about his experience rather than on the objective 


accuracy of his reportage. ; . . , 
, Thus the conversational content of counseling consists chiefly in the 
discussion of the patient's anxicties and the conditions which either 


currently evoke them or seem to be causally linked in some historical 
Sense to them. 


The relationship as 


pect of therapeutic procedure has been d 
i r f -der (54), and other 
Most vigorously expounded by Rogers (44), Snyder (54) oo 


members (3, 6) of the so-called non-directive or client-centerec e 
uch a notion is, of course, by no means new to counseling technique. 


Freud (13) in stressing the idea of transference was talking about 
essentially the same thing: the basic role in psychotherapy of a 
affective bonds uniting client to counselor. In the case of orthodox 


who verbally disclaim any interest in personal history 


H i «peri Y ing to 
data, hardly prevent their patients’ discussing past — would A A i 
go systemati :es of electrically recorded non-directive in ervi 
bo atically through a series © i ean du 
if the d » i f affording a relatively complete case his ory: 
ata collected fall very far short of 2797. y compie à e ed 
a preliminary trial by the writer, using material collected from twelve sessions wit 


Case, the greater part of a typical anamnestic form could be filled out from t 


tio 
ns of the recordings. 


* Even therapists like Rogers; 


he transcrip- 
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psychoanalysis, transference refers to the displacement of childish 
attitudes from the analysand's past to the analyst, who becomes a sub- 
stitute for the important previous objects of his patient's loves and 
hates. That such things do take place in psychotherapy is not ques- 
tioned, but whether they must occur in just such a form for counseling 
to be successful may be doubted. For present purposes, it is merely 
necessary to establish the point that the relationship factor is inherent 
in the psychoanalytic approach to therapy. Cameron (4), writing from 
a point of view strongly influenced by Adolph Meyer, says, 

|... the acquisition of normal biosocial behavior may be greatly facilitated by 
the organization of a permissive situation, in which the patient has maximal 
opportunity to work through his attitudes and responses overtly in the presence 
of a skilled therapist. .... The immediate goal of treatment in the behavior dis- 
orders is that of establishing a biosocial interrelationship . .. in which patient 
and therapist participate. The uitimate goal is that of making this interrela- 
tionship unnecessary and terminating it with benefit to the patient (pp. 576- 


577). 


Dejerine and Gauckler (8) warn, “If... you have not been able to 
awaken a reciprocal sympathy in your patient, and if you have not 
succeeded in gaining his confidence, it is useless to go any further. The 
result that you will obtain will be worthless....”’ Sullivan (58) 
stresses the concept of parataxis and speaks of the psychiatrist’s 
“participating helpfully in the life of the patient." 

While there may be some important differences among the various 
points of view just touched on, it may be pointed out that there is 
virtually universal agreement among clinicians on the importance of the 
relationship; there is also high agreement on certain of its character- 
istics. 

The most underscored aspect of the therapeutic relationship seems 
to be its warmth, permissiveness, and complete freedom from moralistic 
and judgmental attitudes on the part of the counselor. Far from being 
a coldly objective consideration of the patient's troubles, therapy 
necessarily involves a highly personal form of interaction in which the 
counselor is highly acceptant of the client’s behavior, both overt and 
covert, within clearly defined limits. 

" " 

, E LAM oe eges 2 has become somewhat clouded, and 
n s : v some light on the dynamics of the 
counseling relationship generally. As Sullivan (58) points out, anything 
a patient feels, says, or does constitutes the data of the Berpet E 
enterprise, As the ease with data of any Kind oe’ frat fob 
vite. one vhs aA Si NC dir reject, or judge. Among such 
Koc Ahe theoapist d n es that the counselee may develop 
whatever theoretical uci "m í pee to most clinical workers of 
fail Pinea 1 ation, are intimately related to the success 
Of JAMUEE 0 therapy. Here again an atmosphere free from censure OF 
judgment but pervaded by sympathetic understanding is provided by 
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the c dh 
a oe ee other hand, acceptance does not imply approval 
nie it's celings, attitudes, or overt behavior. This is not sur- 
uo a mosi clinical cases hardly approve of themselves, and their 
Cd sa Pprov al provides one of the most important aspects of the dis- 

v that brings them into therapy. 

lilac beer e DAU odar Dron Tides rir e 
dew uos the orms of human interaction. In the first 
sé a ha ogee y one-sided in the sense that the therapist ordi- 
"nci, ys ie about himself and that the changes effected within the 
e oft 1e relationship are centered in the client rather than being a 
ttual modification. The exchange between counselor and patient 
un does not resemble that between friends in spite of the friendliness 
cue generally permeates the relationship. Secondly, it is sharply 
mited in that the therapist's expressed interest in his client does not 
extend beyond the confines of the clinic. The two do not mingle 
socially, the clinician does not usually intercede for the patient in times 
of Stress, and he generally does not become embroiled in attempts to 
manipulate the patient's environment. The therapist's office is desig- 
nated as a place where one can come in perfect safety, free from threats 
and blame, to “think about” one's problems; but it is not a place where 
dispensations are sold or intercessions granted. Finally, there is a tacit 
agreement between therapist and patient that their connection is to be 


Severed as soon as the patient feels free to go about his business without 
In other words, the interest, acceptance, and 


the counselor's support. 
for the client to make capital of so 


u E 
affection" of the therapist is there 
So long as he wishes it. Unlike non-clinical situations, there is no pres- 


Sure on him to maintain the relationship out of politeness or any of the 
Other social rules that more or less govern intimate relationships in 


Society at large. 
All this may be recapitualted by saying that the methods common 
to the various forms of psychotherapy involve (a) the formation of a 
Special kind of personal relationship and (b) a conversation with the 
patient about his anxieties and the events which tend to produce them. 
.and the physician- 


As Finesinger (10) puts it, “Communication. . 
Patient relation are the tools that must be adapted to the goals of psy- 


chotherapy.” 
The argument thus far, then, runs something like this: The common 
Problem characterizing clinical patients is anxiety and the behavioral 
1 1 of psychotherapy, regardless of 


defenses built up against it. The goa che : 
the therapist’s theoretical leanings, is to eliminate the anxiety and 


thereby to do away with the symptomatic persistent non-integrative 
oal, all therapists use the devices of con- 


behavior, To accomplish this g t the di t con 
Versing with the patient about his anxiety and the situations calling it 
forth both currently and historically, and forming a unique therapeutic 
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relationship. Since all psychotherapies seem to have successes to their 
credit and since psychotherapy seems to be a process whereby a patient 
learns to modify his emotional reactions and his overt behavior, it is 
hypothesized that therapy may be conceptualized from the point of 
view of gencral psychology as a problem in learning theory. Such a 
conceptualization must account for the changes that occur in counselees 
in terms of these factors that are apparently common to all forms of 
counseling. Before attempting such a conceptualization, it is necessary 
briefly to review the situation in learning theory. 


Mayor THEORIES OF LEARNING 


One of the major issues with which learning theorists are concerned 
has to do with the conditions which are necessary if learning is to occur. 
Two points of view have gained the widest currency with respect to 
this question. 

Reinforcement theory. The first is that of Clark Hull (19) Within 
Hull’s system, learning is thought to proceed somewhat in this manner: 
When a motivated organism is subjected to stimulation—from cither or 
both the stimuli associated with the motivating conditions themselves, 
as in hunger or pain, and those acting on it from the external environ- 
ment—it tends to respond in a trial-and-error way. If, in the course of 
its trial-and-error behavior, the organism performs a response which is 
associated with the reduction of motivation, the probability of that 
response's occurring again under similar stimulus conditions is in- 
creased, or—to put it somewhat differently—the connection between 
the present stimuli and the response is strengthened. The central 
emphasis here is on the occurrence of drive reduction or a satisfying 
state of affairs, variously designated as the law of effect or the principle 
of reinforcement. As Miller and Dollard (31) succinctly sum it up: 
learn, an organism must want something (be motivated in some way) 
notice something (be acted upon by stimulus cues from the external or 
internal environment), do something (perform a response Or response 
sequence), and get something (experience a reduction in motivation). 

Contiguity theory. Opposed to a reinforcement theory of learning 1$ 2 
point of view which holds that the basic condition necessary for learning 
is that of contiguity in experience. Tolman and Guthrie are perhaps the 
outstanding proponents of this theory, although they differ markedly ! 
their conceptions of the nature of learning. 

Tolman (61), taking his point of departure essentially from Gestalt- 
theorie, conceives of learning as the acquisition of information or cogni- 
tions about the environment. Variously referred to as "isign-gestalt 
expectations," “sign-significate relations,” and “hypotheses,” these 
cognitions presumably have reference to knowledge which the organist 
acquires tO the effect that a given stimulus or sign, if reacted to 1” 
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out this problem as early as 1929, referring to it as “the dilemma of the 
conditioned defense reaction." He then wrote, 

For a defense reaction to be wholly successful, it should take place so early that 
the organism will completely escape injury, i.e., the impact of the nocuous (un- 
conditioned) stimulus. But in case the unconditioned stimulus fails to impinge 
upon the organism, there will be no reinforcement of the conditioned tendency, 
which means one would expect that experimental extinction will set in at once. 
This will rapidly render the conditioned reflex impotent, which, in turn, will 
expose the organism to the original injury. This will initiate a second cycle sub- 
stantially like the first which will be followed by another and another indefi- 
nitely, à series of successful escapes [from all contact with the noxious stimulus] 
always alternating with a series of injuries. From a biological point of view, 
the picture emerging from the above theoretical considerations is decidedly not 


an attractive one. : : 
There is thus presented a kind of biological dilemma . - - (p. 511). 


In other words, reinforcement theory finds it hard to explain how an 
organism can learn to avoid painful stimulation entirely, because if the 
painful stimuli do not act upon the organism's receptors, no drive is 
aroused to act as à basis for maintaining the defense reaction. 

Mowrer and Lamoreaux (36), concerning themselves with this prob- 
lem, resolved the dilemma by positing a conditioned fear reaction to 
the conditioned stimulus. According to their formulation, the condi- 
tioned stimulus has signal value, signifying to the organism an approach- 
ing danger and arousing in it those anticipations of punishment known 
as the secondary (acquired) drive of fear (anxiety). On the basis of this 
secondary drive, trial-and-error behavior occurs, out of which is differ- 
entiated, according to the principle of reinforcement, a response which 
reduces the fear and permits the organism to avoid or to minimize the 
painful unconditioned stimulus. 

Such a resolution of the dilemma of the conditioned defense reaction, 
however, gives rise to another difficulty of comparable magnitude: How 
is the fear learned? If one holds to a thoroughly monistic reinforcement 
position, he is forced to say that the drive state of fear or anxiety is 
somehow "satisfying" or motivation reducing. Baldly, the reinforce- 
ment theorist is forced to hold that secondary drive arousal occurs ON 
the basis of drive reduction. That this is certainly contrary to any kind 
of common sense consideration is immediately apparent, and it is dif- 
cult to see how an exchange of one drive for another—the situation 
which would obtain were the law of effect rigidly adhered to— could be 
of any biological benefit. This is particularly true when one recalls that 
many fears, especially neurotic anxiety, are much more debilitating 

than the objective conditions which generate them— witness the many 
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Razran (43) in 1939 offered a somewhat similar formulation, classify- 

ing learning according to what he called "quantitative" and ‘‘qualita- 

tive" conditioning, corresponding to learning without reinforcement 
and law-of-cffect learning. He reports no evidence for the so-called 
qualitative conditioning of autonomic reactions, but does not say 
explicitly that quantitative conditioning applies exclusively to the 
acquisition of viscero-vascular reactions. He docs raise the issue of the 
differential importance of two events, the application or onset of the 

f unconditioned stimulus and the termination of the unconditioned 
stimulus, for the conceptualization of types of learning. 

More recently, Mowrer (34) has vigorously exploited the idea ofa 
two-factor theory of learning to account not only for the learning © 
skeletal muscle responses but for the acquisition of secondary drives 
like fear and anxiety. He fully accepts the notion that striped musele 
acts, mediated by the central nervous system, are learned, according tO 
the principle of reinforcement, by virtue of their association with the 
termination of the noxious stimulation identifed as motivational states. 
This is not only fully in keeping with Hull’s position but is quite in line 
with Mowrer's own previous enthusiastic experimentation and theoriz- 
ing as a monistic member of the reinforcement school (33). His new 
point of view, however, holds that smooth muscle and glandular "emo- 
tional" reactions, autonomically mediated, are acquired through their 
association with the onset of the paired unconditioned stimulus of pain 
and conditioned stimulus or signal. In other words, fear refers to the 
viscero-vascular components of the pain response, conditioned to 2 
substitute stimulus through the latter's contiguity with the onset of the 
action of a noxious adequate stimulus. He prefers to restrict the term 
conditioning to the learning of “emotional” reactions by contiguity and 
to use problem-solving to designate the learning of skeletal responses 
which “solve” the "problems" created by drives and which are acquired 
according to the reinforcement principle. 

One of the points which must be made immediately with respect to 
two-factor theories such as these is aimed at the scotching of the criti- 
cisci ten fend OT leveled against attempts to account for learning 
re be pr (nec Such attempts frequently permit the 
Toten Ibe are aver notion happens most easily to explain his 

; he can explain every thing but predict nothing. With the possible 
exception of Razran's, the two-factor formulations just reviewed are not 
liable to such an attack. While two principles are postulated, con- 
tiguity and reinforcement, two learning processes, one involving the 
viscero-vascular system and the other the skeletal muscular system, are 
also suggested. The principle that governs one process may not be in- 
voked to explain what occurs in the other. For either process, tHe 
theory is monistic and parsimonious and presumably subject to an 
experimentem crucis.? 3 

Direct experimental tests of the two-factor theory are as yet few. 
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difference in the curves of response acquisition between a group of rats 
trained under conditions where the conditioned stimulus overlapped 
and terminated with the turning off of the unconditioned stimulus of 
shock and a group of animals where the conditioned stimulus was 
turned off at the time of the unconditioned stimulus's onset. 

The interpretation of these results is that the animals learned to 
fear the conditioned stimulus by virtue of its contiguity with the onset 
of pain. This anticipation of pain gave rise to trial-and-error behavior 
out of which was differentiated the running response, which was rein- 
forced by fear reduction or the avoidance of pain. The acquisition of the 
fear reaction was not furthered, as reinforcement theory would predict, 
by having the warning signal overlap and end in contiguity with the 
reinforcing state of affairs provided by the termination of the shock. 

The more crucial experiment, yet to be done, would involve the 
testing of the hypothesis that some autonomically mediated reaction, 
taken as an index of fear, will be attached to some conditioned stimulus 
by virtue of its association by contiguity with the onset of noxious 
Stimulation, whereas it will not become attached any more effectively 
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shock to escape from a white compartment with a grid floor through an 
open door into a black compartment without a grid. Subsequently, the 
animals, without shock or noxious stimulation of any kind, learned a 
new habit—rotating a little wheel to open the door, which had been 
closed, in order to escape from the white compartment to the black one. 
This was interpreted to mean that the secondary drive of fear had been 
acquired and that its termination could be used as reinforcement for 
striped muscle responses. In terms of the two-factor formulation, the 
rats learned to run into the black box by virtue of the reinforcement 
provided by pain reduction. At the same time, however, fear or the 
visceral component of pain became conditioned to the cues of "white- 
ness and grid floor” associated with the onset of shock. The conditioned 
fear then served as the drive on the basis of which the wheel-rotating 
habit was learned without benefit of further primary drive arousal 
through shock. 

It would seem, then, that in spite of its present tentative status, à 
two-factor theory of learning—holding that adaptive, striped muscle 
habits are built up according to the principle of reinforcement whereas 
anticipatory, “emotional” reactions, probably viscero-vascular in 
nature and having drive properties, are acquired according to the 
principle of contiguity—has the greatest explanatory and predictive 
power at the moment. 


LEARNING THEORY AND PSYCHOTHERAPY 


How can such a conception of learning be applied to psychotherapy 
to cover the elements of the psychotherapeutic process common to all 
forms of counseling? It will be recalled that the problem of therapy is 
essentially that of somehow ridding the patient of neurotic anxiety, 
which supports his persistent non-integrative defenses and accounts in 
large measure for his "unhappiness." The tools used by all therapists to 
accomplish this job are those of conversational content and the thera- 
peutic relationship. 


Therapy as the acquisition of symbolic controls. Shaffer (49) suggests 
that psychotherapy be conceptualized in terms of the patient's acquisi- 
tion of language symbols by which he can more effectively control his 
non-integrative behavior. The rationale of this approach is based on the 
observation than an outstanding characteristic of the maladjusted is 
their inability to control their own acts; in their own terms, “I know 
should (or shouldn’t) do this, but I just can't (or must)." Since "nor- 
mal" people seem to control their behavior by means of symbols 
including subvocal and gestural symbols—Shaffer’s notion seems at first 
blush to follow readily. 

Such an idea is also more or less explicit in Shaw’s (50) analysis of 


repression and insight. He argues from Mowrer and Ullman's (38) poi? 


that, 
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The common denominator in all . . . forms of non-integrative behavior seems 
to be the inability to use symbols appropriately as a means of bringing remote 
as well as immediate consequences into the present in such a manner that they 
may exert an influence proportional to their objective importance (p. 81). 


Shaw moves from here to the contention that therapy is a process by 
which non-integrative behavior is eliminated by the making available of 
symbols, holding that the symbols become cues for the more remote 
punishing consequences of neurotic defenses. 

It is not quite clear, however, according to either Shaffer or Shaw, 
what the symbolization at which therapy aims might be. If it is the 
symbolizing of acts which have been repressed, there is no indication 
of how such a procedure would accomplish anything more than the 
release of a flood of anxiety heretofore held in check—albeit imperfectly 
—by the repression mechanism. On the other hand, if the symbols 
made available by therapy amount only to accurate predictions of the 
consequences of the client’s non-integrative behavior, their utility is 
questionable on several grounds: First, most clinical patients are only 
too sharply aware of the self-defeating nature of their activity; their 
complaint is that they don’t know why they engage in it and at the same 
time seem unable to avoid it. Second, some cases (especially those who 
have been formally psychoanalyzed) demonstrate a remarkable glibness 
—sometimes quite accurate—about their own defenses and yet eis 
anxiety ridden on the one hand and socially somewhat obnoxious on the 
other. It is probably these instances which gave rise to H. M. Johnson's 
(20) rather oversevere recent strictures on psychoanalysis as bee 
and as rationale. Third, there is a question as to whether or ao y 
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A somewhat different suggestion, here proposed, is this: If neurotic 
anxiety is produced by the repression of some unextinguished response, 
it should follow that the anxiety can be dissipated in one of two ways— 
either by the elicitation of unreinforced occurrences of the response, 
thus leading to extinction, or by the connecting of a different affect to 
response tendencies which have undergone repression. With respect to 
the illustrative case mentioned above, anxiety could be dispelled either 
through eliciting self-initiated behavior and failing to reinforce it until 
extinction occurred, or through forming a bond between the tendencies 
to self-initiated behavior and some non-anxious visceral reaction which 
will supplant the connection between anxiety and the repressed activity, 
In either case, the Shaw-Shaffer notion holds as the first step in therapy, 
the bringing into communicability (consciousness) of the tendency that 
has undergone repression. 

This lifting of repression is what is usually known as insight. When 
the patient is able to verbalize the repressed tendencies fundamentally 
associated with his anxiety, he ''sees" or demonstrates insight. It is 
difficult to understand, however, why this should be equated with cure, 
regardless of how important it is as a step toward psychological re- 
covery. Merely being able to talk about the cues for anxiety does not 
make them any less terrifying. Extinction or counter-conditioning is 
still necessary. 

Whether the extinction or the counter-conditioning technique is 
preferable depends in part on the desirability of the repressed behavior. 
In the case of self-initiated activity, the question seems rather clear. 
Socialization has been defined (35) as the process of developing from a 
dependent infant into an independent and dependable adult. The ex- 
tinction of tendencies toward self-initiated "responsible" behavior would 
mean the continuation of dependence and infantilism. It seems prob- 
able that few clinicians would look upon this as a suitable therapeutic 
goal. The same thing might well be said of most of the impulses which 
typically undergo repression, sexuality being a case in point. The frigid 
wife, raised under conditions of puritanical restrictiveness, might well 
find some immediate relief from anxiety by having her repressed sexual 
impulses extinguished (if this is possible); but it is doubtful that such a 
procedure would be helpful in her marriage. 

The counter-conditioning hypothesis. The hypothesis of counter- 
conditioning is suggested as somewhat more tenable. It involves the 
following set of notions: The conversational content aspect of counseling 
consists in the symbolic reinstatement of the stimuli which produce and 
have produced the patient's anxiety. Through his words to the thera- 
pist, the client, on a symbolic level, again “lives through'' the stimulus 
situations which were painful to him, in which he underwent punish- 
ment, and which initiated the repression sequence. This constitutes the 
lifting of repression, the introduction into communicability of the re 
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pressed tendencies, the development of insight. This proceeds essential- 
ly by the therapist's reinforcing by his acceptance and his sympathetic 
participation of the patient's self-revelatory behavior. At the same time, 
the discussion of the client's anxiety is being carried on within the 
context of the unique patient-therapist relationship. This is conceived 
as an unconditioned stimulus for feelings of pleasure, acceptance, 
Security- —non-anxious affective reactions. The therapeutic process con- 
Sists in the establishment of a bond between the symbolically reproduced 
Stimuli which evoke and have evoked anxiety—chiefly the cues as- 
sociated with the incipient movements toward performing some re- 
Pressed activity—and the non-anxiety, i.e., comfort and confidence, 
reactions made to the counseling relationship. 

Such a formulation goes somewhat beyond the bounds of "emo- 
tional" learning as accounted for by the two-factor theories briefly dis- 
cussed above. They are chiefly concerned with the learning of fear or 
anxiety, basic secondary drives. While the idea presented here may be 
an extension of the theory that its protagonists would find unacceptable, 


there seems to be no reason why the principle of contiguity should not 
apply to viscero-vascular reactions that are “pleasant” as well as to 
those which are "unpleasant"; as a matter of fact, such an application 
Seems to be demanded if the learning of affects is governed by a single 
Principle. The conceptualization proceeds in this wise: Affects possess- 
ing drive value—fcar, anxiety, and anger'—are learned by virtue of the 
association by contiguity of the visceral aspects of some primary drive 
With concurrent external stimuli. The so-called "positive" or ‘pleasur- 
able” affects are learned by virtue of the association by contiguity of 
Proprioceptive cues set up at the onset of drive reduction with con- 
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in experience are those correlated with drive reduction; "unsatisfying 
events in experience are those correlated with drive onset.? 


To return to the counter-conditioning hypothesis in psychotherapy, à 
rather striking analogy may be pointed out between this formulation and the 
now famous experiment of Mary Cover Jones (21) with the boy Peter. It will 
be remembered that Peter was a three-year-old with a number of acquired fears 
of various objects, including small white furry animals. In an effort to eliminate 
these fears, Dr. Jones attempted a counter-conditioning procedure. At lunch 
time, just as the child began to eat a meal which included his favorite dishes, a 
white rabbit was introduced in a wire cage at the end of the room, far enough 
away not to disturb the boy's eating. Each day the animal was brought a little 
closer until finally Peter could eat with one hand while stroking the rabbit with 
the other. Further tests showed that the newly conditioned “comfort” reaction 
to the rabbit had generalized to a large number of other, formerly fear-evoking 
stimuli such as rats, frogs, cotton, and fur rugs. 

The meaning of these results is that a new connection was formed between 
the stimuli (rabbit) which produced a fear reaction and the comfort reaction 
made to the stimulus of the lunch with all its various cues. The necessary condi- 
tion for the formation of this new connection was contiguity of the noxious stim- 
ulus and the comfort reaction aroused by the unconditioned luncheon stimulus 
situation. The problem of how to pair the stimuli so that those connected with 
the meal did not come to evoke fear does not affect the fundamental point of 
contiguity as the basis for the establishment of the new bond, but is merely a 

matter of the spatial and temporal patterning of stimuli common to most ex- 
perimentation under the conditions of classical conditioning. 


The main objection to this analogy probably rests on the point that 
Peter was troubled by a fear rather than an anxiety—that is, an affec- 
tive reaction, uncomplicated by repression, made to external stimuli 
rather than to some impulse to behave in a tabooed way. The objection 
is certainly granted and actually implies the basis for the first step in 
therapy, the uncovering by use of the conversational content of thera- 
peutic interviews of the repressed impulses. Defore counter-condition- 
ing can occur, the stimuli connected with anxiety must be brought into 
communicability, where they can be symbolically reinstated at the ap- 
propriate times. Insight is a prior condition of counter-conditioning. 

A second objection that can be raised to the counter-conditioning 
notion is this: If therapy is simply a matter of connecting anxiety- 
provoking stimuli with some comfort reaction, why is it not therapeuti- 
cally effective to think of one's troubles while lying in a comfortably 
warm tub?? There seem to be three answers to this. First, to a degree it 
is effective. The widespread method of combatting the "blues" by 


7 [t is interesting to speculate as to whether or not this is the mechanism underlying 
the acquisition of aesthetic tastes, preferences, and other “likes” and “dislikes.” The 
implications fora psychological approach to valuative behavior are obvious. " 

8 This point was raised in a very helpful personal communication from Dr. John F- 
Seward. The replies offered to the objection, however, are not chargeable to him. 


PSYCHOTHERAPY AS A PROBLEM IN LEARNING THEORY 387 


means of a shower is directly in point, as is the use of continuous baths 
and warm packs in mental hospitals. The real problem is: Why is such 
à procedure less effective than psychotherapy? This gives rise to the 
second answer, which is that thinking of one's troubles while lying in a 
comfortably warm tub is usually of little help in creating insight, 
symbolically re-introducing the relevant anxiety-producing stimuli. The 
bath is of little assistance in bringing forbidden impulses into com- 
municability, hence the ‘therapeutic effects" of the bath are of short 
duration. The third reply to such an objection is based on the fact that 
neurotic anxiety is primarily social in its inception. Sullivan (59) insists 
that this "interpersonal induction of anxiety, and the exclusively inter- 
personal origin of every instance of its manifestations, is the unique 
characteristic of anxiety and of the congeries of more complex tensions 
-.. to which it contributes." This squares perfectly, of course, with the 
concept of repression and the role it plays in anxiety theory. If neurotic 
anxiety is an anticipation of punishment for the performance of some 
tabooed act, it follows that the taboo must have been laid down and 
enforced through some kind of social medium. Consequently, one 
would expect in the light of such social origins that the elimination of 
anxiety would be facilitated by the presence of certain social factors in 
therapy—provided in this case by the patient-therapist re pesa sca 
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matter of determining the categories of counselor response that most 
effectively further the bringing into communicability of repressed im- 
pulses. On somewhat dangerous 4 priori grounds it would seem that 
interpretation, probing and other more active procedures would be use- 
ful unless introduced too preemptorily or too early into therapy, thereby 
destroying the patient-therapist relationship. That this occurs is not 
denied, but to attack such techniques as being of no value because they 
are sometimes misused seems somewhat absurd. The situation is 
analogous to bringing the rabbit too far into Peter's lunch room too 
early and connecting the fear reaction to the animal to the stimulus 
complex of food, room, high chair, and so forth. It seems somewhat 
nonsensical to argue that the baby should be thrown out with the bath 
water simply because it is still a bit grimy. One wonders if Peter would 
have overcome his fear of rabbits had he only been thoroughly "ac- 
cepted” without ever having any help in reencountering the noxious 
stimulus in à secure and “pleasant” situation. 

The directive-non-directive controversy may well reduce to a con- 
sideration of the types of case for which each is best suited. It can be 
hypothesized that more non-directive approaches will be more likely to 
succeed with those clients who have few and relatively unsevere re- 

ressions, some insight into the sources of their anxiety, and a capacity 
to relate easily to the therapist. These are cases which do not require 
much help in discovering the anxiety-producing stimuli; they do need 
assurance from a counselor that they may talk about them in his pres- 
ence with complete impunity. Conversely, more interpretative methods 
by hypothesis will be of greater effectiveness with cases characterized by 
higher defenses, greater repression, and less initial insight. It must be 
emphasized, however, that all this is a matter of the empirical deter- 
mination of what techniques work best for given cases so far as the 
lifting of repressions is concerned. The hypothesis of counter-condition- 
ing is still the means of explaining the diminution of anxiety after in- 
sight has been developed. 

If this formulation is correct, how can various failures of counter- 
conditioning methods in psychological treatment be answered? Voegt- 
lin's (63) work with alcoholics is typical. This clinician attempted to 
cure his patients of drinking by having them take whiskey so heavily 
dosed with a powerful emetic that vomiting to the point of pain was 
immediately induced. Results were disappointing. Most of his cases 
did not build up more than momentary conditioned aversions tO 
alcohol. Of those few who became conditioned against liquor over à 
period of time, several showed symptom substitutions, e.g; the develop- 
ment of psychosomatic symptoms or neurotic syndromes instead © 
alcohol addiction. ! 

The first objection to such a procedure is that it consists in à direct 

attack on the symptomatic mechanism rather than on the underlying 
anxiety: If the anxiety reduction occurring from drinking were greate 
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than the pai 
the pain of the treatment, the treatment would have very nearly as 


acm ui me strongly advising the patient “to get on the wagon." The 
tha "ele pe hangovers is relevant in this connection. Second, if 
tomen He diction were wiped out by virtue of the conditioning 
TER HA z a anxiety would be unaffected, and one would 
rid ace one icd bina the patient would develop some other persistently 
eee ue seen bod of reducing 1t. Third, the treatment situation 
fran hart m clements of attempting to eliminate a response by 
Foi bu ig its The inefficacy of such methods has already been 

ssed. Thus, an objection based on such therapeutic experience fails 


to carry much weight. 


Reeducation in psychotherapy. Does the point of view developed here 


rbd m notion in the therapeutic armamentarium? On the con- 
ae a ge includes it as an important third aspect of counseling, 
ser d m: the lifting of repression and the counter-conditioning of 
hewn T i “ollowing the development of insight, as anxiety is dissipated 
dation 1 conditioning, the patient typically begins to plan. His first ten- 
1 der Qi in this direction may take the form of asking, What shall 
d rit may be a more vigorous exploration of the possible conse- 
quences of projected steps. Here the therapist may be of assistance in 
helping his client to formulate goals clearly and to consider realistically 
the various behavioral methods he might employ to reach them. This 
constitutes a law-of-effect learning situation in which reinforcement is 
produced through the patient’s own verbal self-approval or self-dis- 
Sperone, based in part on the predictions of consequences which the 
le can help him arrive at. In a sense this constitutes the "ra- 
a a exercise of symbolically mediated self-control of which Shaw 
ect haffer may be speaking. It is rational insofar as the behavior se- 
ed is founded on some consideration of its probable remote outcomes 
rather than on its immediate value as an anxiety-reducing agent, and 
It is “responsible” insofar as it is chosen? in terms of the patient's own 
values as of the moment of choice. The counselor does not direct; he 
merely helps the client work out relatively accurate estimates of the 
Consequences. If a particular behavior pattern is rejected, it merely un- 
dergoes a voluntary suppression Or is extinguished through failure o 
reinforcement without being forced into incommunicability and becom- 
ea a stimulus for anxiety, as is the case in the repression of punished 
Sordengies: Through this symbolic trial and error, then, the patient ak 
i according to the principle of reinforcement, a tentative p 
Shin al behavior based on rational considerations to supplant us 
Meg pattern of persistent non-integrative behavior based on the 1m- 
late necessity of reducing anxiety regardless of the ultimate cost. 
free will, reference is 


o heavily with ere 
1a deterministic phi- 


9 
Lest the language used here seem flavored to Wet 
f choice withir 


mad 
lies to Hall's (16) paper, in which the problem © 
phy is discussed. ' 
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SUMMARY 


A learning theory interpretation of psychotherapy must take into 
account (a) the fact that all forms of psychotherapy are able to claim 
cures, (b) the similarity of clinical cases in terms of neurotic anxiety and 
its defenses, (c) the common goal of psychotherapies of the diminution 
of anxiety, and (d) the fact that all clinicians employ as their chief 
techniques conversational content and the therapeutic relationship. 

It is here proposed that psychotherapy occurs through three inter- 
related processes: first, the lifting of repression and development of in- 
sight through the symbolic reinstating of the stimuli for anxiety; second, 
the diminution of anxiety by counter-conditioning through the attach- 
ment of the stimuli for anxiety to the comfort reaction made to the 
therapeutic relationship; and third, the process of reeducation through 
the therapist’s helping the patient to formulate rational goals and be- 
havioral methods for attaining them. 

Such a scheme seems to harmonize most eflectively with a two- 
factor learning theory of the type most recently developed by Mowrer 
(34). Such a theory conceives of skeletal muscle responses as being 
acquired through the principle of reinforcement, whereas viscero-vascu- 
lar, "emotional" reactions are acquired according to the principle of 


contiguity. 


This formulation is certainly not to be regarded as anything final. It 


leans rather too much on plausible but inadequately tested hypotheses 
and on scientifically tenuous analogies. It is offered only as a prelimi- 
nary attempt to effect a rapprochement between psychotherapy and 
general psychology, and to organize some of the phenomena of clinical 
practice within the framework of systematic behavior theory. 
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STATISTICAL METHODS APPLIED TO RORSCHACH 
SCORES: A REVIEW! 


LEE J. CRONBACH 
Bureau of Research and Service, University of Illinois 


p While the Rorschach test grew out of clinical investigations, and is 
still primarily a method of individual diagnosis, there is increasing 
emphasis on statistical studies of groups of cases. On the whole, the 
statistical methods employed have been conventional, even though the 
Rorschach test departs in many ways from usual test methodology. The 
present review proposes to examine the methods which have been em- 
ployed to deal with Rorschach data, and to evaluate the adequacy of 
those often used. It attempts to provide a guide to future investigations 


by indicating statistically-correct studies which can serve as models. 
; the generalizations about the test 


There is no intent here to review 
arising from these studies, or to call into question general research pro- 


cedures, sampling, and other aspects of the studies. 
This report may be considered an extension of a review by Munroe 


(41). In 1945, she considered the objectivity of previous Rorschach re- 
search. She distinguished between the goals attainable by clinical in- 
tuitive interpretation and the goals to be reached by more quantitative 
Procedures. She traced the trend in Rorschach literature, noting the 
gradual decrease in studies based solely on impressionistic treatment of 
data or on mere counting of scores, and the introduction of significance 
tests, standard deviations; and other signs of adequate effort to test 
generalizations statistically. She also pointed out some errors in sta- 
tistical thinking that lead to faulty conclusions about the Rorschach 
test. Munroe takes the position, and the writer fully concurs, that sta- 
tistical research on the Rorschach test is not only justifiable, but indis- 
Pensable, The flexibility of clinical thinking creates excellent hypotheses, 
but these hypotheses can only be established as ‘true by controlled 
Studies, Among the propositions suggested by clinical work, some are 
certainly untrue, due to faulty observation, inadequate sampling, and 
errors of thinking. Statistical controls are essential to verify theories of 
test interpretation, and to validate proposed applications of the test. 
Even though the clinician studying one person makes no use of statis- 
oS he employs generalizations about the test which must rest on id 
tifically-gathered evidence. Munroe demonstrated that the Rorsc ae 

test lends itself to objective studies; the writer reviews the same ma- 


rederick Mosteller and to N. L. Gage, 
for its improvement. 


393 


ation to F 


H H 
The writer wishes to express apprec! : 
d many suggestions 


Who 
n : : 
ead this manuscript and contribute 


394 LEE J. CRONBACH 


terial more technically to evaluate the soundness of the statistical pro- 
cedures on which the conclusions are based. 


CLINICAL TREATMENTS OF DATA 


While this paper deals principally with statistical methods applied 
to raw Rorschach data, we shall consider briefly the statistical pro- 
cedures used when clinically interpreted case records are used in a study. 
The Rorschach record is usually interpreted qualitatively and in a 
highly complex manner when the test is given in the clinic, and many 
studies have been based on these interpreted records. In only a few 
studies of this type do statistical problems arise. 


Dichotomized Rorschach ratings. In one type of study, the interpreter 
of the records makes a final summary judgment, dividing the records 
into such groups as "adjusted-maladjusted" or “promising-unpromis- 
ing,” etc. This method is most used for validation studies, where the 
Rorschach judgment is compared with a criterion of performance or 
with a judgment from some other test. Simple statistical tests suffice to 
test the degree of relationship. If the criterion is expressed in two cate- 
gories (as when the criterion indicates success Or failure for each case), 
chi-square is simple and appropriate. This is exemplified in a study of 
success of Canadian Army officers (51), where a prediction from the 
Rorschach is compared with a later rating of success and failure. If the 
criterion is a set of scores on a continuous scale, bi-serial r is usually an 
adequate procedure. In bi-serial 7, one assumes that the dichotomy rep- 
resents a continuous trait which is normally distributed. This assump- 
tion is generally acceptable for personality traits and for ratings of 
success. 

Rorschach ratings on continuous scale. In some studies, the Rorschach 
interpretation is reported in the form of a rating along a scale, rather 
than as a dichotomy. When the criterion is dichotomous, bi-serial 7 is 
appropriate. (E.g. a prediction of probable pilot success is so corre- 
ie wth dimittam from raining, 21 p; 032), rac 

, like grade-average, product-moment is convention- 
ally used. 
of he qu cale n papi cie: sequ because of a limitation 
equal, the correlation may not indic d pos nos psychologically 
If ratings are careful, one can assu zm ae eje tos velecidnslr o, 
rior to men rated “Fai ” and th spem a vr n edi bt x d 
tomm ig Eu in 5 at men rated "Excellent aresuperior to 
A od! to A deg y be unwise to assume that the jump from 
xcellent" is equal to the jump from "Fair" to “Good, 
as one automatically does in correlating. One solution to this difficulty 
is to assume that the trait rated is normally distributed in the men stud- 
ied. Then we can condense the five-point scale into a dichotomy, whic 
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pu M aed in „the preceding paragraph. Alternatively, one 
E mn the ratings into scaled values which will yield a normal dis- 
moüs n 4). Bi-serial ris then appropriate, if the criterion is dichoto- 
predi £ imilar reasoning applies to the correlation of a rating with a 
Canis ru E cd will obtain the most meaningful results by 
en mizing the rating and using bi-serial r, or by normalizing before 

g product-moment r. These suggestions are summarized in Table I. 


TABLE I 


PREFERRED METHODS FOR COMPARING RORSCHACH INTERPRETATIONS WITH 
CRITERIA OF VARIOUS TYPES 


Judgment made from Rorschach 


TUS 1 


Continuous scale, 


Criterion 
Di f 
ichotomy unequal units 


x2 after dichotomizing rat- 
ing; rvis after normalizing 
GER rating* 
ontinuous scale, unequal | x? xt after dichotomizing both 
units terion; fbis variables; rvis after nor- 
izing criterion* malizing one, dichotomiz- 
ing the other; product- 
moment 7 after normaliz- 
önti ing both a 
inuous scale, equal Pin” ryis after dichotomizing rat- 
units ing; product-moment r 
after normalizing rating 


Dicho 
tomy | x? 


after dichotomizing cri- 
after normal- 


* * 5 
Point bi-serial must be used if the two parts of the dichotomy cannot reasonably 


be . : 
Considered subdivisions of a continuous scale. 


Rorschach adjustment rating with suc- 
both variables were reported on a four- 


Category scale, used a coefficient of contingency. Where the correlation 
Surface is nearly normal, this coefficient with proper corrections should 
Sive approximately the same result as the product-moment 7 for nor- 
malized data, corrected for broad categories. Yates (70) has recently 
Offered an alternative method of adapting the contingency method to 
take advantage of trends in the relationship between variables expressed 


aS ordered categories. . . 
Matching methods. Another favorite technique for evaluating Ror- 
Schach results is blind-matching, which permits a study of each case 
as a whole," When a set of Rorschach records (interpreted or not) and 
another set of data regarding the same individuals are available, one 
may request judges to match the two sets in pairs. The success of 


Pe Munroe (42), comparing à 
Ss in academic work, where 
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matching is evaluated by a formula developed by Vernon (66). An ex- 
ample of its use is a study by Troup (62), in which judges tried to match 
two Rorschach records for each person. One hundred fourteen matches 
were correct out of a possible 120, judges considering five pairs at a time. 
By the Vernon formula, this corresponds to a contingency coefficient of 
88. A coefficient of .40 was obtained when judges attempted to match 
the record of each case with that of his identical twin. Another excel- 
lent illustration of the method is provided by J. I. Krugman (31), who 
used it to establish that different evaluations of the same Rorschach 
protocol could be matched, and that the interpretations could be 
matched to the raw records and to criteria based on a case-study. 

The limitations of this method are not statistical; they lie more in 
the human limitations of judges. A portrait based on the Rorschach 
may be nearly right, yet be mismatched because of minor false clements. 
Matching, on the other hand, might be excellent, even perfect ; the study 
would still not guarantee that each element in each portrait was correct, 
especially if the subjects were quite different from each other. In fact; 
the portrait might be seriously wrong in some respects, without prevent- 
ing matching. 

A complex modification of the blind-matching method has been 
proposed and tried by Cronbach (9). Judges are asked to decide whe- 
ther each statement on a list fits or does not fit a case described in a cri- 
terion sketch. Since only about one-third of the statements in the list 
were actually made about the given case, one can test by chi-square 
whether the matching is better than chance. The method yields many 
interesting types of information: (a) an all-over estimate of the validity 
of predictions with relation to the criterion, (b) a separate estimate of 
the validity of the description for each case or for subgroups, and (c) 
an estimate of the validity of statements dealing with any one aspect 
of personality (e.g., social relations). 


EnRORS IN STATISTICAL STUDIES 


The majority of statistical studies with the Rorschach test have 
treated Rorschach scores directly, with clinical judgment eliminated. 
This is an important type of investigation, which presents numerous 
problems. Before considering general questions of procedure however: 
it is necessary to deal with several errors and unsound practices found 
in the literature reviewed. These miscellaneous errors must be pointed 
out lest they be copied by later investigators, and to suggest that the 
studies in which the errors occurred need to be reevaluated. 


Significance tests for small samples. The critical ratio is not entirely 
satisfactory when applied to small samples. When there are fewer than 


30 cases per group, the / test 1s preferable. This would apply, for exam- 


n————————————— 
LU ———————— à — 
“ah: =-————_—_—_—_—_), = 


ST: $ 
TATISTICAL METHODS APPLIED TO RORSCHACH SCORES 397 


ple, in G EE Ni] : 
^" hs eren oe comparison of obsessionals with supposedly nor- 
is Coena c ; lis significance ratios are a bit too high, since he used 
at cani bs I. /oaitt, with groups of 20 cases. (It may be noted also 
pee ae m is study does not permit sound generalizations about ob- 
sum M Ey camem to other adolescents. The obsessionals had a 
ey he oc compared to 97 for the normals, so that differences be- 
e groups may be due to intelligence rather than obsessional 


trends.) 
is p generally useful for sm 
Becia setae when the number of cases is below 50. This is es- 
lite oe d per tant when the expected frequency in any cell of a 2X2 
les e ple = a lower, under the null hypothesis. Many Rorschach stud- 
inse e; vi the need for corrections, Kaback's (29, pp. 24, 38-39) 
ni dio à ikang example. She compares the distribution of such a score 
in each of two groups. To do so, she makes the distribution in a 


great ip 3 i 
at number of intervals, with only a few cases per interval, and tests 
In such a case, with 


n rend of the distributions by chi-square. 
Nor Aye cell frequencies, no significant result could be expected. 
distribuan useful to inquire, as her procedure does, whether. the precise 
riii DIS of M scores is the same for the two groups (in her case, 
n ene and accountants.) Her major question was whether one 
mia: wate M more than the other, and this could be answered by dichot- 
trees the distribution and then applying chi-square, with proper 
UP ime In applying chi-square to the 2x2 tables, one should asa 
of ira $ pi actice apply Yates' correction (56, p. 169). The importance 
rae ge will be demonstrated in Table IV. Where groups are 
ginal t Si aer it is best to make cuts toward the center, SO that mar- 
Bienes, s will remain reasonably large. Special problems in the ap- 
probl on of chi-square to successive tests of the same hypothesis, and to 
lems of goodness of fit, are discussed by Cochran (6). 
, Tests for significance of difference in proportions. Throughout the 
Rorschach literature, the formula for the significance of differences be- 
The resulting inaccuracy is slight in most 
and even 


twee s 

um proportions is misused. 

E ems, fortunately. This e n other work, 

E ne statistics books appear to endor: procedure. The usual 
Tmula, 


all samples, but it is important 


rror is common i 
se the faulty 


24 24 
p-p 7 a d 
N nz 
the proportions obtained in the two 


ma . " 
Y not be entered with pi and P» 
te po for p, where 


s A 

amples. Instead, one should substitu 
Nip: + Nds 
GAPUE ea 


po = yy, + Ns 
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A significance test inquires whether pi and p: might arise by chance in 
sampling from a homogenous population in which the true proportion 
is po (see 35, pP- 126-129). Employing pı and ps, instead of entering Po 
in both terms, almost always increases the critical ratio over what it 
should be. Because no correct model is found in the Rorschach litera- 
ture, the following example is given using Hertz’ data (25). 


Five boys out of 41, and 0 girls out of 35 gave zero color responses. 


5+0 
po = = 066 
76 
D66 x 934 . .066 X .934 " 
5. d.aitt. = T = .057 
51 35 
diff. 122 
A= 11225 f» = .00; Eur a 2.14 (P = .032) 
s.d.aitt. .057 


This compares to the critical ratio of 2.41 (P =.016) computed by the formula 
Hertz and other workers have inadvisedly used. 


The above computation is equivalent to the determination of sig- 
nificance by chi-square, and yields an identical result. But in this in- 
stance the expected frequencies are so low that the correction for con- 
tinuity becomes important. Applying Yates’ correction, we find that P 
becomes .10, and the reported difference is not significant. 

Several studies use the formula for proportions in independent sam- 
ples when the formula for paired samples should be used. Thus Hertz 
(25), to compare the 12-year-old and 15-year-old records of the same 
cases, should use a formula for correlated samples as given by Peatman 
(44, p. 407) or by McNemar (37; sce also 13, 59). The correct formula 
would have yielded significant differences where Hertz found none. 
Other studies employing matched samples, where the significance O 
differences was underestimated by a formula for independent groups 
are those of Hertzman and Margulies (27), Meltzer (39), M. Krugman 
(32), Richardson (48), and Goldfarb (20). In studies where the subjects 


were children varying widely in age, the proper formula would probably 
have yielded quite different results. 


A study by Brown (4) committed this error and one even more serious. He 
compared records of 22 subjects without morphine and then with morphine: 
He found that 14 increased in Rand 7 decreased. He then treated these 25 in- 
dependent proportions of the 22 subjects, computing the critical ratio for the 
difference 6476 minus 32%. These are not proportions in independent samP es, 
and Brown's statistical tests are meaningless. No manipulation of the increase 
decrease frequencies 1s as satisfactory for this problem as the formula given 
McNemar. Brown could properly have set a cutting score (e.g., 20R) and co?" 
pared the percentage exceeding this level with and without morphine. " 

Siegel's procedure (55), in which the “percentage incidence" of a factor in 


ST: 
{TISTICAL METHODS APPLIED TO RORSCHACH SCORES 399 


one gro TAP. oe inel H 
group is divided by the incidence in the second group, will be likely to pro: 


-— misleading results. 
An alte ivi igni 
Iternative formula for the significance of differences in matched groups 


IS us > Cr E 

hes bees (18). In applying the formula, however, a serious error was 
e formula given by Engelhart which Gann adopted is 

m cain? = (oan? — om (1 — ri’) 

y is the correlation of the matching variables with the variable in which a dif- 
This formula may be extended to differences in propor- 
d tion value (5o) for the proportion sho 
á bere gend He and Mo, as explained above. Gann’s po Ie was aiii 
à malene 7 3 Or fiy in all her calculations. From the context, this seems to be 
The ae correlation of all matching variables with all dependent variables. 
Bases € 3 procedure, for any single significance test such as the proportion of 
‘Sader = swag Mt would be to correlate the matching variables with W- 
the ek Que This correlation would almost certainly be close to zero. By 
shoudl ts i Gann used, the critical ratios are very much larger than they 
establa . In one comparison where Gann reported a CR of 6.02 the writer has 

shed that the true CR cannot be greater than 2.23, and is almost cer- 


tainly less. 
aac ck pin of total number of responses. It is thoroughly unsound to 
Suite (5. the total number of responses of a given type in two samples. 
e (58) tested 37 boys and 45 girls. The boys gave a total of 248 F 
No ree all girls combined gave 246. Swift used chi-square, demon- 
simie that these 494 responses were divided in a way which departs 
tici pe from the theoretical ratio 37:45. But this assumes 494 in- 
Mee ent events in her sample whereas she really had 82. The F re- 
hs e S are not independent, since some were made by the same person. 
gro night. properly have used the /-test, applied to the means of the 
ups. The only correct way to use chi-square on her problem is to 
1 F score (cases, not re- 


co 3 É 
mpare the number of cases exceeding a certair 
À similar error has been made by 


ome being the basis of sampling). 
ertzman (26), Rickers (49, p- 231), and Werner (68). 
Richardson (48) followed a different erroneous procedure. In her 
able 9, she determined what proportion of all responses in each of her 
B Olpe were W responses, and tested the difference in proportions for 
Ignificance using the number of subjects in the denominator of the sig- 
ying is actually the 


nifi : 
cance formula. The “proportion” she was stud 


Tatio Mean W/Mean R, and the standard deviation of this is not cor- 
7 Jf she must test the W/R ratio, in 


re x eee 
ctly given by the formula v/ba/ N st ti 
lered later, it is necessary to deter- 


s : 
Dite of the difficulties to be consi¢ 

y and test differences between 

(e.g., chi-square, t-test, etc.). 


m 
ine the ratio for each person separatel 
udies are peculiarly prone to 


€ groups in one of the conventional ways 
an Infation of probabilities. Rorschach stu : priy pipe i 
rati rror which can arise in any statistical work. If a particu ar critica 
; 09 Or chi-square or t-test corresponds to à P of .05, we conventionally 

ignificant because "such a value would 


Int 
€rpret that as statistically S! 


Anas is being tested. 
lons i 
ns, although the estimated popula 
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arise by chance only once in twenty times." While this usually refers to 
once-in-twenty-samples, it may also be thought of as "once in twenty 
significance tests," if the several tests are independent. In some Ror- 
schach studies, a vast number of significance tests are computed. Thus 
Hertz in one study reported the astonishing total of eight hundred sig- 
nificance tests (25). Many of these comparisons reach the one percent 
level or the five percent level, but even these are not all statistically sig- 
nificant. Quite a few of these differences did arise bv chance, and unfor- 
tunately we cannot estimate how many because the tests were not 
experimentally independent. The proper procedure, in such a case, is to 
recognize that an inflation of P values has taken place. The analogy to 
monetary inflation is a fair one: The increase in the number of signifi- 
cance tests in circulation causes each P to have less worth than it would 
normally. We may accordingly raise our Marice arbitrarily, and insist 
that P reach a higher level than .05 before we label it "significant ," and 
a higher level than .01 before we label it “very significant." Of the dif- 
ferences reported in the Rorschach literature as “significant at the 5% 
level," probably the majority are due to chance. 

There are several ways in which significance levels may be inflated 
so that they become falsely encouraging. One is the common procedure 
of testing differences on a great many Rorschach scores. This is of course 
sound practice, but one must then take the total number of significance 
tests into account in evaluating P. The inflation is more subtle when 
the investigator rejects a large number of hypotheses by inspection with- 
out computing significance tests, and reports only a few significance 
tests. Thus Piotrowski and others (46) compared superior and inferior 
mechanical workers on ‘‘all the components used in conventional scor- 
ing as well as many others." They finally invented four composite scor- 
ing signs on which differences between the two samples were large 
enough to encourage a significance test. Suppose, for simplicity, that 
those four tests had yielded P's of .02. The significance of those P's 
must be minimized in view of the fact that four such differences were 
found in several hundred implied comparisons which were not actually 
computed, and two per hundred is chance expectation. 

A comparable inflation arises when an investigator slices a distribu- 
tion in order to take advantage of chance fluctuations and find some 
"hole" where a test will yield a low P. Hertz applied the formula for 
significance of the difference in proportions, to compare two groups on 
M% (Table II). She introduced a spurious clement by slicing the M% 
distribution in so many places, and making so many significance tests: 
If a distribution is dichotomized in many ways, the chances of a “signif- 
icant” difference rise greatly. Here only one test yielded a P of .05, out 
of nine attempted. The interpretation “It may be said with certainty» 
that more girls than boys at 15 years give over 11% M” (25, p- 180) 1s 
unjustified. In another sample this fluctuation would not occur. It 18 
not necessary to test explicitly all possible dichotomies for this type ? 


STATISTICAL METHODS APPLIED TO RORSCHACH SCORES 401 


error to arise. If the investigator examines his distribution and woes 
his cut at the place where the difference is greatest, he has [i mp Uu 
tion examined and discarded all other possible hypothese: ae ee 
several studies where this occurs is that of ume een pmi 

Multiple correlation procedures give rise toa ces cs contend Es 
ten scores are tried as predictors. These scores mig y e investisstor 
prediction formula in an infinite number of ways. When an i 


TABLE II 


ERENCES IN M' 
SIGNIFICANCE DATA REPORTED BY HERTZ FOR P 
15-YEAR-OLD Boys AND GIRLS ( 


% BETWEEN 


AREE = E 


Difference Tested Critical Ratio 
ale 
Difference in means = 05 
Difference in medians : 
Difference in proportions 81 i 
in interval 0-1 m 10 
in interval 0-3 1.83 P: 
in interval 0-5 1.68 hs 
in interval 0-7 90 05 
in interval 0-9 2.34 Pa 
in interval 0-11 1.24 10 
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Rorschach studies which have reported “significant” differences based 
on an empirical formula without confirming them on fresh samples are 
those of Montalto (40), Harris and Christiansen (23), Hertzman, Or- 
lansky, and Seitz (28), and Ross and Ross (52). Thompson (60) reports 
spurious 7'$ but does not claim significance for them. Buhler and Le- 
fever (5, Tables X, XX) mix new cases with the sample used in deriving 
scoring weights, and therefore fail to provide an adequate test of signif- 
icance. Significance tests on fresh samples have been properly made by 
Guilford (21), Gustav (22), Margulies (38), Ross (50), and Kurtz (33). 
The latter gives a particularly clear discussion of the issue involved. In 
most studies, correlations nearly vanish when a Rorschach prediction 
formula is tried on a new sample. 

Still another method of inflating probabilities is to recombine groups 
of subjects in a way to maximize differences. If one has several types 9 
patients, all of whom earn different mean M scores, these groups may 
be recombined in many ways, and in one of the possible regroupings à 
pseudo-significant difference may be found. Rapaport and his coworkers 
(47) have carried inflation to bizarre levels. Not only did they consider 
scores in great profusion and in numerous combinations. They re 
combined their subjects so that the number of implicit significance tests 
in their volume is incalculable. They began with subjects in 22 sub- 
groups. Significance tests were then made, on any score, between any 
pair of subgroups or combinations of them which seemed promising 
after inspection of the data. There were 231 possible pairs of subgroups; 
and an endless variety of combinations. Thus at times Unclassified 
Schizophrenics Acute were lumped with one, two, or more of the follow- 
ing: Paranoid Schiz. Acute, Simple Schiz., Uncl. Schiz. Chronic, Par. 
Schiz. Chr., Uncl. Schiz. Deteriorated; or with all the schizophrenics 
and preschizophrenics; or with Paranoid Condition, Coarctated Pre- 
schiz., Overideational Preschiz., and Obsessive-Compulsive Neurosis: 
Such willingness to test any hypothesis whatever leaves these workers 
open to the charge of having regrouped their cases to augment differ- 
ences. They have undoubtedly reported differences which were created 
by artificial combinations of chance variations between groups. Every 
time cases are recombined for a significance test, one must recognize 
that a large number of implied significance tests were also made, since 
many other recombinations were rejected without actual computation. 

Rorschach studies, because of the great number of scores and the 
large number of subgroups of subjects involved, are more prone to infla- 
tion than other research. The suggestions to be made for sound practice 
are these: 


1. Compare the number of significant differences to the total number of 


ger E in the study, both those computed and those rejected by implica" 
10n. 
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2. Raise the P value required for significance as the number of comparisons 


increases. 

] 3. Never accept 
discriminating power h 
- E liec do not trust signi 

ependent of the fluctuations of 


an empirical composite score or regression formula until its 


as been verified on a new sample. 
ficance tests unless the hypothesis tested was 


a particular sample. 


c investigator have clearly in mind 
d. Comparisons are of three types: 
data are looked at, i.e., before 
ause a cursory inspection 


These suggestions require that th 
the number of comparisons considere 
those rejected as improbable before the 
the study is begun; those not computed bec 
Showed no apparent difference; and those computed. Sometimes the 


investigator begins with, say; five groups of subjects and ten scores, and 
frankly wants to unearth all possible differences between types of sub- 
jects. Then there are ten ways the groups may be paired against each 
other, and since each pair may be compared on each score, there are a 
total of one hundred comparisons in the study. If, on the other hand, 
the investigator sets out to check only certain relationships—‘‘Schizo- 
phrenics differ from neurotics in Ft% " uManics differ from all other 
groups combined in FC: CF C"—those limited hypotheses may be laid 
down in advance of the study, and only those comparisons are counted 
as implied significance tests. To avoid confusion, it is also well for the 

f ble is to be dichoto- 


investigator to specify his cutting point, if a varia ) 
ifferences between groups. This may be set 


mds before examining the d en groups. . ‘be 
y an arbitrary rule, for instance that each distribution Is to be divided 
as near to its median as possible, or by an a priori decision to divide at 

tor must ask himself 


Some point such as 2M. In essence, the investiga s 
efore he gathers his data, “How many comparisons do I intend to look 


at, and charge myself for?” A P of .01 may be called significant if it is 
One of three comparisons charged for, but not if the investigator has 
looked at three hundred comparisons in order to salvage this one 1m- 


Pressive value. 

G GROUPS ON RonscHACH SCORES 
ween Statistical Procedures 

ers which can be added, aver- 
used conventional 


METHODS or COMPARIN 
Necessity for Choosing beti 


r Because Rorschach scores are numb a 
ged, distri investigators have 
, ibuted, etc., most !n 

Mental-test statistics without question. The most common need for 

res of groups and determine the sig- 

i thods encountered in Ror- 

f difference between means 

critical ratio or t-test); analysis of va bi-serial r; eee 
ifference in proportions exceeding à particular score, or chi-square; 


Si r » 
8Nificance of difference betwee? medians. 
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Apart from such errors as those listed in the preceding section, there 
is no reason for considering any of the procedures under discussion as 
mathematically incorrect. If a significant difference is revealed by any 
proper significance test, the null hypothesis must be rejected. Neverthe- 
less, the investigator may not choose one of the techniques at random. 
Different methods of analyzing the data will lead to different conclusions. ln 
particular, some procedures lead to a finding of no significant difference 
even though a true difference could be identified by another attack. 


Let us illustrate first with some of Kaback's data (29). She administered 
the group Rorschach to men in certain occupations, and, inter alia, compare 
her groups on the number of popular responses. The mean for accountants is 
7.0; for accounting students, 7.3. By the t-test, the difference between means is 
not significant (P ca. .40). (Point bi-serial rapplied to the same data gives the 
same significance level. Point bi-serial r and ¢ are interchangeable procedures. 
and there is no merit in testing the hypothesis in both ways.) Butif she had 
chosen the chi-square test, quite proper for her data, Kaback would have found 
a significant difference between the groups. Chi-square would be applied to 
compare the proportion of cases in each group having five or more popular 
responses. From her Table IV, this proportion is 60/75, accountants; 72/15, 
accounting students. The difference between accountants and accounting stu- 
dents is significant (P <.01.) In this and other instances, Kaback disregarded 
a difference when the null hypothesis could be confidently rejected. 

Further illustrative data are taken from Hertz' comparison of Rorschach 
scores of boys and girls. She tested each possible difference by several statistical 
devices, yielding results such as those for M% reproduced in Table II. By any 
of nine methods, she is informed that the two sex groups differ no more than 
might two chance samples. By the other computations, she is informed that the 
difference is significant at the 5% level. If different significance tests disagree, 
what one concludes depends nearly as much on what procedure one adopts as 
on the data themselves. 

Hertz compared her boys and girls in 46 instances. Each time, she tested the 
significance of differences between means and between medians. Four times 
the means differed significantly; five times, the medians differed significantly. 
But in only one out of 46 comparisons was the difference significant by both 
methods. Itis greatly to Hertz’ credit that she saw the applicability of more 
than one significance test. But conclusions of research will be hopelessly con^ 
fused and contradictory, unless we can find a basis for choosing between the pro- 
cedures when one says * "Tis significant" and the other says “Taint.” 


The choice between comparison of means and medians or betwee? 
the t-test and chi-square cannot be left to the inclination of the experi- 
menter; the whole point of statistical method is to make an analysis 
freed from subjective judgment. The reason different methods yiel 
different results is that they make different assumptions or try to dis- 
close different aspects of the data. It is therefore important to recognize 
the ways in which the techniques differ. Differences which are of little 
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* concern in connection with most studies have peculiar importance in 
Rorschach work. The difficulties which make choice of procedures an 
important problem arise from three causes: the skewness of Rorschach 
scores, the complications introduced by ratio scores, and the dependence 


of Rorschach scores on the total number of responses. 


Choice of Techniques in View of the Inequality of Units 
in Rorschach Scales 


chach scores give sharply skewed dis- 
This fact is reported repeatedly (2, 25, 
here many subjects earn 0, 1 or 2 
CF, and C), and in the loca- 
elf is no bar to conventional 
he mean and median are 


: Many of the significant Rors 
tributions for most populations. 
47). Skewness is usually found wl 
points (i.e., M, FM, m, the shading scores, 
tion scores W, D, Dd, and S. Skewness its 


Significance tests. But in skew distributions t 
ve a significant difference in 


not the same. Two distributions may ha 
medians, and not in means (or vice versa) if either is skewed. Further- 
more, it is doubtful if a satisfactory estimate of s.d.man can be obtained 


for a skewed distribution. 
nd related procedures. In any statistical 
s.d., t, analysis of vari- 


of scores (mean, 


Disadvantages of the mean a 
es at different parts of the scale 


computation based on addition 


ance), numerical distances between scor : 
are treated as equal. Thus, since the average of 3 W and 7 W is the same 


as that of 1 W and 9 W, these computations assume that a shift 3 W to 

W is equivalent to, or counterbalances, a shift 7 W to9W. There is no 
Way of demonstrating equality of units unless one has some knowledge 
of the true distribution of the trait in question, or à definition of equality 
in terms of the characteristics of the property being measured. This 


Problem is present in virtually all psychological tools, but other tests 
hich are assumed to represent the true 


yield n H " 1 : 
Spread Poel. a hé other hand, Rorschach inter pretation based on 
clinical experience constantly denies the equality of units for Rorschach 
Scores. The average W score is near 6, and scores from 1 to 10 are usually 
Considered to be within the normal range- No matter how extremely a 
Person is lacking in W tendency, his score cannot go below zero. For 
One who overemphasizes W, the score may g UP i e ade 


Score onlv si E low the mean may be consi : 
y six points be from the mean in 


€ as extreme i irection as a score fifteen points € 
e in that dire (42) has prepared a checklist which shows 


the oth i unroe ri 
er d i ro i 
irection. M would have to be g ouped An 


OW units of certain Rorschach scores 
empting to study differences 


). In att Á 
thod of determining 5.d.mán- 


a n . 
in M. argument is presented by Richardson (48 ecne 
edians, Richardson unfortunately uses an ner 


406 LEE J. CRONBACH 


order to represent à regularly progressing scale of maladjustment. Her 
groupings based on clinical experience are of approximately this nature: 

W (or W%): 0 (or 1 poor) W response; 1-14%; 15-60%; 61-100%- 

Dd%: 0-997; 10-2496; 25-49%; 5010070. 

m: 0-1; 2-3; 4—5; 6 or more. 

If these units represent increasing degrees of maladjustment, the raw 
Rorschach scores do not form a scale of psychologically equal units. It 
is advisable to accept the clinical judgment on this point, especially ia 
the absence of evidence for the assumption of equal units. 

Use of median and chi-square. Unlike procedures involving the addi- 
tion of scores, procedures based on counting of frequencies make no 
assumption about scale units. In fact, they give the same results no 
matter how the scale units are stretched or regrouped. The median, or 
the number of cases falling beyond some critical point (e.g. 10 W), de- 

ends only on the order of scores. This appears to justify the recom- 
mendation that counting procedures such as the median be given prefer- 
ence over additive procedures such as the mean in dealing with skew 
Rorschach distributions. To test the significance of a difference between 
two groups, the best procedure is to make a cut at some suitable score, 
and compare the number of cases in each group falling beyond the cut, 
using chi-square. This procedure is used by Rapaport (47) and Abel (1). 
The test of significance of differences between proportions yields the 
same result (see above). One virtue of cutting scores is that we may 
test for differences between groups both in the “high” and “Jow” di- 
rections. This is important, since either very high F% or very low F9 
for example, may have diagnostic significance. In the usual analysis 
based on means, deviations of the two types cancel. 

In contrast to the chi-square method, many tests of significance in- 
volve computation of the standard deviation. These include the critica 
ratio of a difference between means or medians, analysis of variance, 
and the /-test. In these procedures, great weight is placed on extreme 
deviations from the mean. If mean W is 6, a case having 25 W increases 
zd (which enters the computation of the s.d.) by about 361 points; 4 
case having 15 W increases Ed by about 81 points; and 0 W, by only 
36 points. dn skewed Rorschach distributions, the few cases with many 
responses in a category have a preponderant weight in determining 7 
and the significance of the difference. Whether weighting extreme cases 
heavily is acceptable depends on whether one considers the difference 
between 15 W and 25 W to be psychologically large and deserving 9 
more emphasis than, say, the difference from 0 W to 5W. Chi-square 
weights equally all scores below (or above) the cutting point. 

„Normalizing distributions. One method used to obtain more equal 
units is to assume that the trait underlying the score is distribute 
normally in the population. Raw scores are converted to T-scores which 


m 


STATISTICAL METHODS APPLIED TO RORSCHACH SCORES] 407 


D distributed (35, 67). (This procedure must be distinguished 
r conversion, also called a T-score, used by Schmidt (54). 
Scores of the type Schmidt used are not normally distributed.) The ef- 
fect of normalizing is to stretch the scale of scores as if it were made of 
rubber. Extreme scores below the median are weighted symmetrically 
to extreme scores above the median. Thus, in the conversion table pre- 
pared by Rieger and used by the writer (10), the median (63 W) is placed 
at 100, and a score of 0 W is converted to 66, while 28 W becomes 134. 
This in effect compresses the high end of the W scale and expands the 
low end. This conversion does not alter any conclusion or significance 
test obtained by dichotomizing raw scores and applying chi-square. 
But the conversion alters markedly any conclusion based on variance or 


on comparison of means. 


There is obviously much merit in using a procedure which leads to a 


single invariant result, independent of the assumption of the investi- 
gator about the equivalence of scores. Even if scores are normalized it is 
d to indicate central tendency, and chi- 


advised that the median be use j r 
Square to test significance. If, for some experimental design, the data 
believes normalized 


must be treated by analysis of variance, the writer 
Scores will give results nearer to psychological reality than raw scores, 


but thus judgment is entirely subjective. 

. Comparison of mean rank. Attention should be drawn to a new tech- 
nique invented by Festinger (14) which is peculiarly suitable to the 
Problem under discussion. This method assumes nothing about equality 
of units or normality of distributions, being based solely on the rank- 
order of individuals. To test whether two groups differ significantly in a 


Score, one pools the two samples and determines the rank of each man 
in the combined group. The mean rank for each group 1$ computed and 
the significance of the difference is evaluated by Festinger's tables. The 
ed in Rorschach research. 


method has not yet been employ I not interchangeable 
T. $ d chi-square are e. 
he Festinger method an ic of a particular study. Chi- 


Whi he logi 
hich should be used depende pic ey GrOUP A contain more deviates 


Square answers such a question as : i 
than Group B in the ic being studied?” fp amd aed e 
Weight to diffe I al the scale, and there ore asks W 
rences all along z 
two groups differ, all scores being considered. E. omens i inim 
4 is quite i 1 i in the middle oi the range 
quite important but differences : 
Practica] ponerse In another study, differences all along the scale 
a 2 
Te worth equal attention. 4 — 
The Festinger method appears to have the advantage ‘ g ^ 
Stability for small samples. Chi-square 1S much easier to use oe s 
of 30 or more per group. The Festinger method is not useful when we 
are Aere chee in score. Further experience with the new me 


May disclose other important distinctions. 
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Significance Tests Compared with Estimates of Relationship 


Some investigators have perhaps not conveyed the full meaning of 
their findings to the reader because of a failure to distinguish between 
tests of the null hypothesis, and estimates of the probable degree of re- 
lationship between two variables. The former type of result is a function 
of the number of cases, whereas the latter is not, save that it becomes 
more trustworthy as more cases are included. When an investigator 
applies chi-square, the /-test, or the like, he determines whether his 
observations force him to conclude that there is a relationship between 
the variables compared. But if the degree of relationship is moderately 
low, and the number of cases small, the null hypothesis is customarily 
accepted even though a true relationship exists. It is proper scientific 
procedure to be cautious, to reject the hypothesis of relationship when 
the null hypothesis is adequate to account for the data. But in Ror- 
schach studies, where sample size has often been extremely restricted, 
nonsignificant findings may have been reported in a way which dis- 
couraged investigators from pursuing the matter with more cases. 


The study of McCandless (36) is a case in point. McCandless compared 
Rorschach scores with achievement in officer candidate school. In each instance 
save one, the t-test showed P greater than .05 that the difference would arise in 
chance sampling. But the samples compared contained only thirteen men per 
group. Under these circumstances, it would take a sharply discriminating score 
to yield a significant difference. If the sample size was raised to about 50 per 
group, and the differences between groups remained the same, twelve more of 
McCandless’ thirty significance tests would be significant at the five percent, 
or even the one percent, level. When more cases are added, the differences will 
certainly change and most of them will be reduced in size. In fact, the writer 
believes, on the basis of other experience with statistical comparisons of the 
Rorschach with grades, that McCandless’ negative findings are probably close 
to the results which would be found with a larger sample. But the point is that 
McCandless, and other investigators using small N's, have submitted the 
Rorschach to an extremely, perhaps unfairly, rigorous test. One way to com- 
pensate for the necessary rigor of proper significance tests is to also report the 
degree of relationship. A chi-square test may be supplemented by a contingency 
coefficient or a tetrachoric r. A t-test may be supplemented by a bi-serial 7, OF 
point bi-serial (not to determine significance, as Kaback used it, but to express 
the magnitude of the relationship). Sometimes reporting the means of the 
groups and their standard deviations, to indicate the degree of overlapping, is 
an adequate way to demonstrate whether the relationship looks promising 
enough to warrant further investigation. 


. To restate the problem: the investigator always implies two thing® 
in a comparison of groups: (a) that he considers the null hypothesis i 
definitely disproven by his data, or else that the null hypothesis is one 
way to account for the data, and (b) in case the null hypothesis stil 
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remains tenable, that he does or does not judge further investigation of 
the question to be warranted. He can never prove that there is no es 
lationship. So, if his data report a non-significant difference, he must 
Judge whether the difference is "promising" enough to warrant further 
Studies. This judgment is not reducible to rules in the way the signifi- 
cance test is. Whether to recommend further work depends on the 
difficulty of the study, on the probable usefulness of the results if a low 
order of relationship were definitely established by further work, and in 
the Investigator’s general confidence that the postulated relationship is 
likely to be found. 
Methods of Partialling Out Differences in R 


. The usual approach when comparing groups is to test the differences 
in one score after another, and then to generalize that the groups differ 
in the traits to which the scores allegedly correspond. The various 
Scores, however, are not experimentally independent—a man’s total 
record is obtained at once, and his productivity influences all his scores. 

f two groups differ in R, they may also differ in the same direction in W 
(whole responses), D (usual details), and Dd (unusual details), 


Thus consider the Air Force data in Table III. 


TABLE III 
RORSCHACH SCORES COMPARED TO Success IN PrLor TRAINING (21, p. 632) 


Mean of Mean of 
Rorschach Successful Unsuccessful Bi-serial r 

Score Cadets Cadels 

R 18.5 15.8 14 

Ww 9.2 7.3 .24 

D 74 6.7 .03 

w% 60.2 op 4 4 

D% 31.7 37. ‘ 

ns in W and 


The first group has more responses than the second. From the mea 
' It would appear that the first group has more W tendency than the second, 
ut is equal in D But when responsiveness is controlled by converting scores 


i percentages, the difference in W becomes small and the second group is shown 
€ stronger than the first in emphasis on D. 
The most striking illustration of this difficulty is Goldfarb's com- 
Parison of obsessionals and normals. The obsessional group averages 
3 R; the normals, 14. Under the circumstances, it is not at all in- 
9rmatiye to fiiic to test W, D, and Dd; all differ significantly in the 
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same direction. One learns nothing about differences between groups in 
mental approach, which is the purpose of considering these three scores. 
Most of Goldfarb's other comparisons also merely duplicate the in- 
formation given by the test in R, that is, that the obsessionals are more 
productive. Although the discrepancy between the groups in R is un- 
usually striking in Goldfarb's group, it is present to a lesser but signifi- 
cant degree in a great number of other studies, including those of 
Buhler and Lefever (5), Hertzman (26), Naback (29), Margulies (38), 
and Schmidt (54). 

A similar problem complicate! cc's comparison of schizophrenics 
and normals on D. The means were 19.0 and 19.9, respectively; the 
o’s were 13.5 and 9.9. Beck comments as follows: 

The small difference is accentuated in the very small Diff./S.D. diff: 0.34. 
There is, however, probably a spurious factor in this small difference. The 
ogives give us a hint: up to the eighty-second percentile, the curves run parallel, 
with that for controls where we should expect it, higher. Above this point, 
the schizophrenics’ curve crosses over, and continues higher, and more scatter- 
ing, as we should expect from the S. D. The spurious clement lies undoubtedly 
in the fact that the schizophrenics' higher response total would necessarily in- 
crease the absolute quantity of D, since these form the largest proportion of 
responses in practically all records. Absolute quantity of details is then no indi- 


cator of the kind of personality we are dealing with. . . . The medians for D are 
14.46, 17.2 (2, pp. 31-32). 


When one makes several significance tests in which the difference in 
R reappears in various guises, one becomes involved in a maze of seem- 
ingly contradictory findings. And interpretation tempts one to violate 
the rule of parsimony, that an observed difference shall be interpreted by 
the fewest and simplest adequate hypotheses. To answer the question 
how do obsessionals and normals differ? it is simpler to speak of the 
former as more productive than to discuss three hypotheses, one for 
each approach factor. And one may certainly criticize Hertzman and 
Margulies (27) for interpreting differences in D and Dd between older 
and younger children as showing the former's greater "cognizance of 
the ordinary aspects of reality" and greater concern with facts. The 
older group gives twice as many R's as the former, which is sufficient tO 
account for the remaining differences. 

One might argue that R is resultant rather than cause, and that the 
differences in W, D, Dd, etc. are basic. But the Air Force demonstratio? 
that R varies significantly from examiner to examiner (21) suggests 
strongly that responsiveness is a partly superficial factor which should 
be controlled. 


Only two studies examine their data explicitly to determine if differences in 
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other categories could i i 

(68) f les cou be explained in terms of responsiven 

ia É C - difference in di% between is ae aa a, Werner 

sabe tae e : he latter gave significantly more R's. He therefore en 

ie cee ree responses in each card, and arrived at new totals Witt i 
id about constant, he found the dd difference still marked and vue 


validly i T 
D eden terpret his result as showing a difference in approach 
also dT € wi | vd a that groups who differed in glucose tolerance 
nificantly in R. After testing diff i 
tiM sere : g differences in M and 
ota! sample, they discarded cases until tl ri —— A. 
eiue: ) a 2 he two subsamples were e i 
wke eas between the groups in M and C were in the same I nere rere 
was held constant, they were able to conclude with greater vom mein 
ce 


that glucose tolerance is related to M and C. 
ese phone ie in R are tested for significance, it is appropriate to 
pens E 1er hypotheses are required to account for differences in the 
in A ut there other hypotheses should be independent of R; other- 
e merely repeats the former significance test and obscures the 
pars " gg control method is to divide scores by R, testing differ- 
MEDIA dm D^, M%, AN, P% etc. Such ratios present serious 
ifficulties discussed in the next section. Moreover, these 


formulas fail to satisfy the demand for independence from R. There 
(For a sample of 268 


mav . ^ 
Pu be correlation between R and W %, etc. 
uperior adults from a study by Audrey Rieger, the writer calculates 


an r's: WO, x R, —.45, M%XR, 03, F%XR, .06. In the latter two 
ERA iac is no functional relation of the percentage with R, but the 
"Ws i sia are heteroskedastic. ows, =3.30 when R 5-19 (74 cases) but 
ene when R 40-109 (82 cases). The corresponding sigmas for M% are 
s pa 3.35; for F%, 3.23 and 2.29. Only M% is really independent 

One may control differences in R by other methods, provided many 
Cases are available. One procedure is to divide the samples into sub- 
Soups within which R is nearly uniform (e.g., R 20-29), and make 


Significance tests for each such set. A method which requires somewhat 
the total sample or a 


f » i 
i Cases is to plot the variable against R for 
andard sample, and draw a line fitting the medians of the columns 


m his may be done freehand with no serious error. Then the proportion 
the cases in each group falling above the line of medians may be 


co. 
pared by chi-square. 


Difficulties in T! veating Ratios and Differences 
idespread use, the Rorschach test 


thmetic combinations of directly 
core, or the percentage in which 
les are W: M, M:sum C, W/R 


~ phe than any previous test in w 
cou ™ployed "scores" which are ari 
nted scores. One type is the ratio s 

* divisor is a variable score. Examp 
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(W%), and F/R (F%). The other type of composite is the difference 
score, such as FC — (CF--C). In clinical practice, scores of this type are 
used to draw attention to significant combinations of the original scores; 
the experienced interpreter thinks of several scores such as FC, CF, and 
C, at once, placing little weight on the computed ratio or difference. 
When these scores are used statistically, however, there is no room for 
the flexible operation of intelligence; the ratios are treated as precise 


quantities. 


It may be noted in passing that a few workers (e.g., 63) appear to assume 
a 
b 
of the ratios and the ratio of the means may be quite unequal. One cannot, as 
Kaback did (29, pp. 33, 53, 55), assume that if the ratio of the means is greater 
for one group than another, the groups differ in the ratio scores themselves. The 
reader may convince himself by computing the mean ratio for each of the fol- 
lowing sets of data in which Mean a/Mean b is constant: 


024683 0 2 4 6 8 O 2 4 6 8 


that Mean a/Mean b is the same as Mean ~. This is of course not true; the mean 


9’ 4’ 5" 810’ 6! @ 2° 40° 4" 10’ aret d 2 


One difficulty with ratio scores is their unreliability. Consider a 
case with 5 W, 1 M. The ratio W:M is 5. But M is a fallible score. On 
a parallel test it might shift to 0 or to 2. If so, the ratio could drop to 
23, or zoom to infinity; such a score is too unstable to deserve precise 
treatment. The unreliability of another ratio is illustrated in Thornton 
and Guilford's data (61). The reliabilities were, in one sample, .92 for 
M, .94 for C, but .81 for M/C. In a second sample, the values were .77, 
.65, and .31. If unreliable ratios are added, squared, and so on, one 
commits no logical error, but psychologically significant differences 
become overshadowed by errors of measurement. 

f Ratios based on small denominators are in general unreliable (7). 
W% is unreliable for a subject whose R is 12, but relatively reliable for a 
case whose R is 30. In the former case, addition of one W response 
raises W% by 8; in the latter, by 3%. Errors of measurement always 
reduce the significance of differences by increasing the within-groups 
variance. A significant difference in W% might be found for cases where 
R>25. A difference of the same size might not be significant for cases 
where R <25 because of the unreliability of the ratio. If the significance 
test were based on all cases combined, the difference might be obscure 
by the unreliability of the ratios in the latter group. One possible pro* 
cedure is to drop from the computations all cases where the denominator 
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Is low. (If there is a significant difference even including the unreliable 
Scores, this need not be done.) 
The issue of skewness must again be raised. In the M:sum C ratio, 
all cases with excess C fall between zero and 1. Those with excess M 
range from 1 to ©. The latter cases swing the mean and sigma. Follow- 
ng the argument of a preceding section, it is injudicious to employ 
Statistics based on the mean and standard deviation, as McCandless 
(36) did. By such procedures, different conclusions would often be 
reached if both M: sum C and sum C: M were tested. Procedures lead- 
ing to a chi-square test are to be recommended, as illustrated in several 
studies (Rapaport, 47, pp. 251; Rickers, 49; etc.) Another solution, less 
generally suitable, is to convert ratio scores to logarithmic form to ob- 
tain a symmetrical distribution (61). 
A hidden assumption in ratios and differences is that patterns of 
Scores yielding equal ratios (or differences) are psychologically equal. 
hus, in W% the same ratio is yielded by 2 W out of 10 R, 8 W out of 
40, and 20 W in 100 R. One can always define and manipulate any 
arbitrary pattern of scores without justifying it psychologically, but 
better conclusions are reached if the assumption of equivalence is 
defensible. The regression of W on R is definitely curved. A person with 
2 W out of 10 R is low in W tendency, since it is very easy to find two 
Wholes in the cards. Only people with strong tendency and ability to 


Perceive wholes can find 20 W in the ten cards, regardless of R. As R 
the additional responses come 


rises above 40, W seems to rise very little; ition: 
ing decline in W% reflects a 


Principally from D and Dd. The resulti 
tive to quantity, rather than a decreased interest in W (cf. 47, p. 156). 


Put another way: a strong drive to W can easily lead to 90 or 100% 
W when R «15: but such a ratio in a very productive person is unheard 
of. If the rarei of a on b is linear and a close approximation to 


A b) — some constant, ratios may be € asa ee with little hesitancy. 

therwise the ratio is a function of the denominator. : 

This factor is recognized by Munroe, who indicates repeatedly in her 
ratio depends on R. Thus 


Checklist that the significance of a particular : 
0-40% WM is rated p if R10, but 16-29% is rated + if R=50. Nu- 


Merically equal Rorschach ratios, then, are not psychologically equal. 
-“@Paport reflects the same point in testing differences between ame ow 
m W/D. Instead of applying chi-square to the proportions having the 
Tatio 1:2 or lower, he adjusted his standard. 

., , In record Ris too low or too high, we took cogniz 
It is eme A few IV's and difficult to get too many. 


ance of the fact that 
Thus, in low 
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(W%), and F/R (F%). The other type of composite is the difference 
score, such as FC—(CF+C). In clinical practice, scores of this type are 
used to draw attention to significant combinations of the original scores; 
the experienced interpreter thinks of several scores such as FC, CF, and 
C, at once, placing little weight on the computed ratio or difference. 
When these scores are used statistically, however, there is no roon for 
the flexible operation of intelligence; the ratios are treated as precise 
quantities. 


It may be noted in passing that a few workers (e.g., 63) appear to assume 
è a 
that Mean a/Mean b is the same as Mean b This is of course not true; the mean 
of the ratios and the ratio of the means ma i tas 
i y be quite unequal. One cannot, 
Kaback did (29, pp. 33, 53, 55), assume that if the ratio of the means is greater 
for one group than another, the groups differ in the ratio scores themselves. The 


reader may convince himself by computing the mean ratio for each of the fol- 
lowing sets of data in which Mean a/Mean b is constant: 


One difficulty with ratio scores is thei iabili ider 2 
case with 5 W,1 M. The ratio W: M is eee eee On 
a parallel test it might shift to 0 or to 2. If so, the ratio could drop t° 

2, or zoom to infinity; such a score is too unstable to deserve precis? 
treatment. The unreliability of another ratio is illustrated in Thornto? 
and Guilford's data (61). The reliabilities were, in one sample, -92 Í9 
M, .94 for C, but .81 for M/C. In a second sample, the values were Th 
.65, and .31. If unreliable ratios are added, squared and so on, gne 
commits no logical error, but psychologically digiilisant difference? 
become overshadowed by errors of measurement 

f Ratios based on small denominat f 
W% is unreliable for a subject whose R is 12, but relatively reliable for i 
case whose R is 30. In the former case, addon of one W response 
raises W% by 8; in the latter, by 3%. Errors of measurement always 
reduce the significance of differences by increasing tie within-grouP* 
variance. A significant difference in W% might be found for cases whe 
R>25. A difference of the same size might not be significant for cas 
where R «25 because of the unreliability of the ratio. If the significans 
test were based on all cases combined, the difference might be obscure 
by the unreliability of the ratios in the latter group. One possible p 
cedure is to drop from the computations all cases where the denominat 


ors are in general unreliable (n. 


STATISTICAL METHODS APPLIED TO RORSCHACH SCORES 413 


islow. (If aud -— x i 
oe ee I 
" zt Mene of skewness must again be raised. In the M:sum C ratio, 
riba with excess C fall between zero and 1. Those with excess M 
mm aa 1to c. The latter cases swing the mean and sigma. Follow- 
e fea M imr of a preceding section, 1t 1s injudicious to employ 
(36) "Hel es on the mean and standard deviation, as McCandless 
Hs : i such procedures, different conclusions would often be 
en both M: sum C and sum C: M were tested. Procedures lead- 
s 0 a chi-square test are to be recommended, as illustrated in several 
studies (Rapaport, 47, pp. 251; Rickers, 49; etc.) Another solution, less 
— suitable, is to convert ratio scores to logarithmic form to ob- 
ain a symmetrical distribution (61). 
"m. hidden assumption in ratios and differences is that patterns of 
TAn J ielding equal ratios (or differences) are psychologically equal. 

is, in W% the same ratio is yielded by 2 W out of 10 R, 8 W out of 
40, and 20 W in 100 R. One can always define and manipulate any 
arbitrary pattern of scores without justifying it psychologically, but 
better conclusions are reached if the assumption of equivalence is 
defensible. The regression of W on R is definitely curved. A person with 
2 W out of 10 R is low in W tendency, since it is very easy to find two 
wholes in the cards. Only people with strong tendency and ability to 
Perceive wholes can find 20 W in the ten cards, regardless of R. As R 


rises above 40, W seems to rise very little; the additional responses come 
ng decline in W% reflects a 


Principally from D and Dd. The resulti 
= to quantity, rather than a decreased interest in W (cf. 47, p. 156). 
ut another way: a strong drive to W can easily lead to 90 or 100% 
W when R «15; but such a ratio in a very productive person is unheard 
of. If the regression of a on b is linear and a close approximation to 
(a/b) = some constant, ratios may be used as a score with little hesitancy. 
therwise the ratio is a function of the denominator. 


This factor is recognized by Munroe, who indicates repeatedly in her 
of a particular ratio depends on R. Thus 


Checklist that the significance 
30-40% M is rated = if R=10, but 16-29% is rated + if R=50. Nu- 
Merically equal Rorschach ratios, then, are not psychologically equal. 
;"apaport reflects the same point in testing differences between groups 
: W/D. Instead of applying chi-square to the proportions having the 
Tatio 1:2 or lower, he adjusted his standard. 


In record. s hi 
too low or too hig 
s where R is too ded difficult to 


S difficult not to get a few 


i h, we took cognizance of the fact that 
uis get too many. Thus, in low 


414 LEE J. CRONBACH 


R records the 1:2 norm shifted to a “nearly 1:1" while in high R records, the 1:2 
norm shifted to a 1:3 ratio (47, p. 134). 


This adjustment was evidently done on a somewhat subjective — 
and is therefore not the best procedure. It is unfortunate that wo 
other workers have unquestionably assumed that a given score in 0: 
M%, or FC — (CF-- C) has the same meaning regardless of R. " 
At best, ratio- and difference-scores introduce difficulties due to ut 
reliability and to assumptions of equivalence. There isa fairly adequate 
alternative which avoids statistical manipulation of ratios entirely. on 
need only list all significant patterns, and determine the frequency o 
cases having a given pattern. Thus M: sum C can be treated in these 
categories: coartated (M and C 2 or below); ambiequal, M or C «2, M 
and C differ by 2 or less; introversive, M exceeds C by 3 or more; extra- 
tensive, C exceeds M by 3 or more. Any other psychologically reason- 
able division of cases may be made, and significance of differences tested 
by chi-square, provided that the hypothesis is not chosen to take 
advantage of fluctuations in a particular sample. Even this method, 
however, does not escape the criticism that a given pattern of two 
scores, such as 3 M, 3 C, has different significance in records where R 


differs greatly. To cope with this limitation, the pattern tabulation 
procedure is suggested later. 


A detailed consideration of certain work by Margulies is now appropriate, 
since it affords an illustration of many problems presented above. Her study of 
the W: M ratio employed a procedure almost like that just recommended, but 
with departures which are unsound. Margulies compared Rorschach records of 
adolescents having good and poor school records (38). Only her 21 successful 
boys and her 32 unsuccessful boys need be considered here. She was interested 
in comparing them on the W: M pattern, in view of Klopfer's belief that this 
ratio indicates efficient or inefficient use of capacity. She not only tested her data 
in several ways. but reported the data so that other calculations can be made. 
"Table IV reproduces a part of her data, and shows the results of seven different 
procedures for determining the significance of the difference. 

It should be noted first that Yates' correction is essential for tables with 1 
d.f. and low frequencies; in each case where it is applicable, the correction lowers 
the significance value importantly. Second, attention may be turned to the use 
of chi-square to test differences between two distributions. Even if more cases 
were available, it would be unwise to apply chi-square to the distribution cell- 
by-cell (Procedures 2, 3), since this procedure ignores the regular trend from 

class-interval to class-interval. Instead, the distribution should be dichoto- 
mized. Therefore, procedure 5 is preferable to2, and6is preferable to 3. It will E 
noted that these recommended procedures indicate higher significance than the 
tests in which the distributions are compared cell-by-cell. -—— 
Margulies is one of the few writers to note the unsoundness of assuming t ily 
equal ratios are equal. She pointed out that 20 W: 10 M is not psychologica 
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TABLE IV 


RESULTS OBTAINED WHEN A SET OF DATA IS TREATED BY A VARIETY OF PROCEDURES 
(Data from Margulies, 38, pp. 23, 26, 44) 


Distribution I Distribution II Distribution III 


Suc-  Unsuc- W/M Suc- Unsuc- Pattern Suc- Unsuc- 


Number 
of M cessful cessful ralio cessful cessful of W cessful cessful 
boys boys boys boys and M — boys boys 
Sio 5 5 «1 1 1 W «6, 
Ee Moi 0 10 
2 9 8 — 1.00 0 2 TW «6, 
M»1 8 2 
1 3 1o 1.1-2.9 8 5 W>s, 
M 0-1 7 9 
E 4 g 3.049 5 7 W 6-10, 
M2 3 1 
24.9 3 9 W 6-10, 
M>2 1 4 
« (W/0) 4 8 W»10, 
M>1 2 0 


similar to2 W: 1 M, and she demonstrated that the regression of M on W is sig- 
nificantly curvilinear. She therefore was properly critical of procedures such as 
3 and 6. She next turned to the scatter diagram of M and W, and found suc- 


cessful boys predominating in some regions, and unsuccessful boys in others. 


fter grouping scores into regions as shown in Distribution III, she divided the 
cluding cases where W is 0 to 5 and M is 2 or 


Surface into two areas, one area in ri 
10 and M is 3 orover, plus cases where W is over 

10 and M is 2 or over. In other words, instead of testing whether the groups are 
ht line M 72 (Procedure 5), she made her 


differen i 
tiated by a cut c ; 
cutting line an irregular one. This hypothesis, tested in Procedure 7, gay e ap- 
Parently quite significant results. The results are of little value, Bue m^ 
the hypothesis was “cooked up" to fit the irregularities of these specific data. 
In the cells where IV is 6 to 10, and M is 2, there a son 
Of unsuccessful boys. But to draw the cutting line pae 
areas where tl i ful predominate is a type ° 

iti he unsuccess ul pre O: £ à : 
Vitiates a ve bae test. Hundreds of such irregular lines might ne apn 

herefore, it would be expected that in any sample some inaron : depen 
Yielding a difference “significant” at the 1% level. At be e S eue 
Sets up a hypothesis which, if found to yield a singificant di erence in 
independent sample, could be taken as possibly true- 

problem. Wt 


The law of parsimony enters this l 
yen ss 

ay be wir equally well by two hypotheses, m we pai 

t cie i lar cutting lines, à - 

9 accept the simpler hypothesis Irregular cu' panel aud aea 


i i es j 
Hons in terms of patterns of scores, are sometimes J 


nerever a set of data 
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TABLE IV— (continued) 
Results with 
Yates’ 
€ Procedure Result P sorretti 
Y x P 
Central 1. Significance of differ- 
tendency ence in mean M CR= .70* .48 
Cell-by-cell 2. Chi-squareapplied to x? = 3.78*** ca. .30 
comparison Distribution I (3 d.f.) 
3. Chi-square applied to 
Distribution II (5 
d.f.) x = 5.30* ca. .40 
4. Chi-square applied to 
Distribution III (5 
d.f.) xX =17.73* <.01 
Dichotomy 5. Chi-square applied to 
number of cases with 
M>1 (Dist. I) x! = 3.46** .06 2.54** 11 
6. Chi-square applied to 
number of cases with 
W/M»3(Dist.]]) x? = 1.86** 448 1.13** .30 
Frequency of se- 7. Chi-square applied to 
lected patterns frequency having 
M>1 if W>6 or 
>10; having M>2 if 
6<W<10 (Dist. III) x? = 6.58** .01 5.13** .03 


* Computed by Margulies. 
** Computed by the writer. 
*** Computed by the writer. Margulies reports 3.64. 


sary. But in this case the difference between the groups is explained as 
well by the hypothesis that the successful boys give more M's as bY 


any non-spurious test of the W: M relationship. Therefore, procedure 5 
is the soundest expression of the significance of the Margulies data- 
With more cases, this difference might be found to be truly significant- 

In the above analysis, we find again that different procedures, more 
than one of which is mathematically sound, give different conclusions. 
The results from chi-square are less compatible with the null hypothesis 
than is the critical ratio, Chi-square applied to a dichotomy gives ev! 


t 
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dence of a possible relationship whereas chi-square applied to the fre- 
quency distribution does not. Attention is again drawn to the necessity 
t suspicion any significance test based on a com- 
advantage of the fluctuations of fre- 
Finally, it is noted that explanations in 
ld not be sought unless they can ac- 
than can hypotheses in 


of regarding with grea 
plex hypothesis set up to take 
quencies in a particular sample. 
terms of ratios and patterns shou 
count for observed differences more completely 
terms of single scores. 
TREATING PATTERNS OF SCORES 

Rorschach workers continually stress the importance of considering 

any score in relation to the unique pattern of scores for the individual. 


While this is done in clinical practice, there is no practical statistical 
lex interrelations of scores and 


procedure for studying the infinite comp 
indications on which the clinician relies. Instead of considering the in- 
dividual patterns, the statistician can at best study certain specific 
patterns likely to occur in many records. A pattern can be exceedingly 
prevent one from studying 


complex; there is no statistical reason to f 
whether (for example) more men than women show high-S-on-colored- 


cards-accompanied-by-emphasis-on-M-and-excess.of- G over The 
only limitation the statistical approach imposes 1s that the same pattern 


of scores must be studied in all cases. E 
Patterns of scores may be considered by means of composite scores, 

by definition of significant "signs," and by the pattern-tabulation 
method. The composite score is simply an attempt to express, in a 
formula, some psychologically important relationship. Examples in- 
mposites developed 


clude the AM: sum C ratio, and the more complex compost! d 
by Hertz or Rapaport. These scores may be treated statistically like any 


Score on a single category, although most of them are ratios or differ- 


ences and suffer from the limitations already discussed. 
' approach has been 


ttai , 
Comparing inci tstons.” The signs 
; ng incidence of '5!£ i 
widely used i is simple and well-adapted to the Rorschach test. Nor 
acteristic of a special group, 


tifies some char f group 
| observation. Then this characteristic 1s 


Such " : ini 
ih neuron from Ri rang ore ving e Chara 
Istic. One 2 ‘sign, for example, is FM> M. After the. sain e: 
Ypothesizes that Sore sign is discriminative, the pees d oir 2 
Making a test of significance to se if the sign 1S found more often in the 
e a new sample of 


mally, an investigator iden 


‘ype of : soundly compar! 
person in question. One may 2 

* diagnosed inq ‘ith a control sample by noting the frequency of 

i group wi i-square. This procedure is illus- 


ne . H h 
od ee ésch group and applying rargulies (38), and Ross (50). 


tr A 
ated in studies by Hertzman an 
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The investigator may invent his own signs, if he follows die pie 
cautions to avoid misleading inflation of probabilities. Often it is ez ror 
and equally wise to use a predetermined set of signs. The sese a 
set of signs available at present is the Munroe checklist. She has ic ud 
fied numerous ratios and patterns of scores which she considers signi : 
cant of disturbance in her subjects (adolescent girls). She has —-— 
that she does not think of her method as a set of signs (41), but the dif- 
ference between her list and others appears to be (a) that it ca crie 
inclusive survey of all deviations in a record and (b) that the list is de- 
signed as a whole to minimize duplication from sign to sign. There is ng 
reason why two groups may not be compared by applying the checklist 
to every record, and then comparing the groups on the frequency with 
which they receive each of the possible checks. Chi-square is the proper 
significance test, as used in one of Munroe's studies (43). The Munroe 
signs sometimes are simply defined (e.g. P— is 0 or 1 populai response), 
but some involve patterns of several scores (thus the sign FM + is de- 
fined in terms of FM, M, and R). 

Paltern tabulation. Pattern tabulation is a method devised by Cron- 
bach for the study of relations between two or three scores (10). It has 
the advantage of permitting one to study the distribution of patterns in 
a group. To deal with any set of three scores, e.g. W, D, Dd, one normal- 
izes the three scores for each person, and considers the resulting profile. 
The profile is expressed numerically in terms of the deviation of the 
converted scores from their average for each person. These three scores 
can be plotted on a plane surface, and the resulting scattergram shows 
the distribution of patterns in a group. If two groups are compared, any 
type of pattern found more commonly in one group than another can be 
identified, and the difference in frequency tested by chi-square. The 
significance level for rejecting the null hypothesis must be set conserva- 
tively, as this method involves many implied significance tests. An 
analysis of variance solution is also possible but not recommended in 


view of the fact that distributions of patterns are often non-normal. 
This method cannot consider hy 


scores at once. It functions best wh 
able and equally intercorrelated. 
fact that some Rorschach scores ar 


potheses involving more than three 
en the three scores are equally reli- 
It encounters difficulty due to the 


e unreliable, since any serious error 
of measurement in one score throws an error into the profile, The 


method does, however, appear flexible and especially useful for such 
meaningful patterns as W-D-Dd and M-sum C-F. 


Another group of procedures leading to composite formulas for dis- 
criminating groups is treated in the next section. 


DISCRIMINATION BY COMPOSITE SCORES 


In many problems, it is desired to use the Rorschach to discriminate 
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between tw 
two groups. Thus i c i 
M cec g ps. ` one might seek a scoring formula to predict 
ccess, or a "neurotic index" to screen neurotics from a general 
Peut The methods used to arrive at composite scores are the 
ecklist, the multiple regression equation, and the discriminant func- 


tion. 

" Me. scores. The checklist consists of a set of signs. Each person 
wes c iun the checklist and the total number of signs or checks is 
mu Uia score. This method has had considerable success, 
: ably in Munroe's study (42) and in the formula of Harrower-Erick- 
on and Miale for identifying insecure persons. There are no serious 
statistical problems in the use of checklists. The total score can be cor- 
related (though eta may be preferable to r). Differences between groups 
may be tested for significance, preferably by chi-square. Chi-square is 
advised because a difference in the non-deviate range is rarely psycho- 


logically significant; the investigator is usually concerned with the pro- 
Buhler and Lefever justifi- 


Portion of any group in the deviate range. 
ably apply analysis of variance to their checklist score, to study its abil- 
ity to differentiate clinical groups (5). 

loping checklist scores. A com- 


Problems do arise, however, in deve 
roups on one raw score after another, 


mon method is to compare two g 

noting where their means differ. Each score where a difference arises is 

then listed as a sign, and counted positively or negatively in obtaining 

the checklist score for each case. This method takes advantage of what- 

ever differences between samples arise just from accidents of sampling. 

If sample A exceeds B in mean M, allowing one point in the total score 
his sample, the A's 


for high Af will help discriminate A's and B's. Int 
Will tend to earn higher checklist scores. But often in a new sample such 


à difference will not be confirmed, and the M entry in the composite 


Will not discriminate. 


One study employing the s 
Workers. Davidson (12) sough 
background and Rorschach performan 
Pe Her treatment of data is notewor 

ures; statistics are applied with grea 


adopted for each new type of comparison. A 
Some of the judgments she made in selecting procedures, her treatment is free 


rom overt errors and well worth study by other Rorschach investigators. 
Davidson divided her 102 cases among seven economic levels. She studied 

the Rorschach performance in various ways. First, she made a clinical analysis 
9! each child, and placed him in one of nine categories (introvert adjusted, child- 
'sh, constricted dienirbed etc.). The distribution which resulted is a 7x9 
‘able, Recognizin that the expected frequency in each cell is quite small, she 
Combined groups = form a 3X3 table before applying the chi-square test for 
Significance. This same type of condensation would have been advisable - d 
*r comparisons she made, suc 3 


h should be pointed out to Rorschach 
ne the relationship between economic 
f highly intelligent chil- 
thy because of the flexibility of her pro- 
t intelligence, new procedures being 
While the reviewer disagrees with 


ign approac! 


t to determi 
ce in a group o 


h as that between personality pattern an 


420 LEE J. CRONBACH 


Davidson next applied a list of signs, and obtained for each case the Se. 
ber of signs of maladjustment. The number of signs was correlated wit eco 
nomic level, and the correlation was shown not to differ significantly from zero. 
She tested the significance of the difference in mean number of signs by the 
critical ratio. These procedures appear well suited to her data. A third attack 
on the data treats one Rorschach score at a time. Here Davidson placed her 
cases in seven categories, ranging from highest to lowest economic level. By 
analysis of variance, she demonstrated that differences among the seven groups 
were significant only for a few of the scores. The application of analysis of vari- 
ance to continuous data appears to have been an unwise decision. Analysis of 
variance, like chi-square or eta applied to a variable divided in several cate- 


gories, ignores the order of the categories. Consider the following set of means 
in the score M —sum C: 


Economic level 1 2 3 


4 5 6 7 Total 
Mean score 


1.17 1.86 1.29 0.96 —0.75 —0.13 —0.71 0.63 


The downward trend from Group 1 to Group 7 gives great support to the hypoth- 
esis that this score is related to economic level. Analysis of variance estimates 
significance without considering this trend; the same significance estimate would 
be arrived at if Group 2 had had the mean of —0.13 and Group 6 the mean of 
1.86. Davidson might have computed the correlation between each score and 
the economic level, but the skewness of some Rorschach scores weighs against 
this suggestion. The simplest procedure for testing this trend is to split the 
group into a 2 X2 table by combining adjoining categories in the economic scale, 
and dichotomizing the Rorschach score at a convenient point. Chi-square would 


then give the significance estimate. Such a procedure might have yielded signi- 
ficant differences in several instances where Davidson found none. 

In justice to Davidson, it should be repeated that her data have been singled 
out for critical comment because o 


; f their exactness and completeness, rather 
than because they were improperly handled. The foregoing suggestions point 
to ways in which she might have arrived at additional important findings. 


The multiple regression formula. A limitation of check lists is that 
they are simple additive combinations of signs which individually dis- 
criminate. But in such a composit 


e a given trait may enter several 
times if it is reflected in several signs, and thus have greater proportion- 
ate weight then it deserves. The checklist method does not allow for the 
possibility that certain signs may reinforce each other to indicate more 
severe maladjustment than is indicated by a combination of two other 
non-reinforcing signs, or for the possibility that two signs which are 
individually unfavorable may operate to neutralize each other. Multi- 
ple regression and the discriminant function are more powerful proce- 
dures than the usual 


S checklist score, because they consider the intercor- 
relations of scores and weight them accordingly. 


By multiple correlation, one arrives at a regression equation which 
assigns weights to those variables which are correlated with a criterion 
and relatively uncorrelated with each other. This formula may be used 


ai 


an 
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predict or to discriminate between groups. One such formula is that 
of the Air Force, used in its attempt to predict pilot success (21): 


2(Dd+S%) +6 FM+8 W-—1.5 D95--R - (VIII - X90). 


Multiple correlation does not seem especially promising for Ror- 
schach studies. Even such an elaborate formula as that above turns out 
E. have little or no predictive value when applied to a fresh sample. 

zven if it were stable, any formula of this type must assume that 
strength in one component compensates linearly for weakness in another. 


In this formula, emphasis on Dd would cancel weakness in FM, in esti- 
mating a man's pilot aptitude. 


f It is most unlikely that the factors can- 
cel each other in the personality itself. 


The simple linear regression 
formula provides an efficient weighting if the assumption of linear com- 
but interrelations bet 


ween aspects of personality are 
x to be adequately represented in 


pensation is valid, 
this way. The 
on formula is that, W 


probably far too comple 
most that can be said for a regressi 
large samples (and this may require 
diction formula than the simple check! 
yield very accurate predictions if inter 
as complex as Rorschach interpreters claim. 

The discriminant function is a relatively new technique giving a 
formula which will separate two categories of men as thoroughly as pos- 
sible from a mixed sample. It would be used to develop an effective in- 
dex for separating good from poor pilots (not for predicting which man 
will be best, as the regression formula does) or for distinguishing or- 


ganics and feeble-minded. A practical procedure for dealing with mul- 
tiple scores has just been published b se (45), and has not been 


y Penrose 
employed in Rorschach research. It appears likely to have real value 
in studies comparing different types o 


f subjects. . 
Like the regression formula, however, the discriminant function pro- 
vides a set formula. it is assumed tha 


t one factor com- 
pensates for or reinforc n another factor. The interactions 
Within personality are probably too complex to be fully expressed by 
linear or quadratic discriminant functions. 

CORRELATION AND RELIABI 


For one purpose or 
Jationship between 


hen derived on 
5000 cases), it is a more precise pre- 
list score can be. It cannot hope to 
relations within personality are 


LITY 
another several studies 
the several Rorschach 


Scores or between Rorschach scores and external variables. The con- 
ventional procedure for showing that two characteristics are associated 
m to compute a product-moment correlation between the variables. 
This has been done by Kaback (29), Vaughn and Krug (64), and others. 

between variables 


This m z how the full relationship 
e ble to shov E : 
When the cate one on the other is curvilinear. Such a regression 


h Correlations of scores. 
ave tried to show the re 
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often occurs when one variable or both have a sharply skewed distribu- 
tion. In fact, Vaughn and Krug note that one of their plots is curvi- 
linear. The extent to which association may be underestimated is sug- 
gested by the following data. The data used are taken from tests ad- 
ministered individually by Audrey Rieger to several hundred applicants 
for employment, usually for managerial or technical positions. The tests 
were carefully scored by the Beck method. Generalization from the 
data must be limited because the group is not a sample of any clearly 
defined population. For 268 men, the product-moment correlation be- 
tween D and Dd is .735. The curvilinear correlations are nnan, .785; 
"54, .823. There is significant curvilinearity. If D and Dd are normal- 
ized, the regression becomes linear except for the effect of tied scores 
where Dd —0; for the converted scores, r —.767. 

Brower employs rank-difference correlations in comparing certain 
Rorschach scores to physiological measures (3). 
method for small samples and is equally sound for linear and non-linear 
regressions. Thus, a rank-correlation of W/M with another score is the 
Same except for sign as the correlation for the inverted ratio M/W, but 
the product-moment correlations are far different. 

The rank method does have the di 
the small and unreliable differences in the shorter end of skew distribu- 
tions, where many cases have the same rank. This might lower the 
correlations for a score like Fc, but is not a difficulty with scores dis- 
tributed more symmetrically over a wide range, such as F or VIII-X 90. 
Normalizing has the same disadvantage. This is a reflection of the 


inability of the test to discriminate finely among cases in the modal end 
of a severely skewed distribution. 


Reliability coefficients. Test r 


This is a useful 


sadvantage of weighting heavily 


eliability is ordinarily estimated by the 
These methods are not very appropriate 


The split-half method introd 


variable denominators such as W% 


- two sets of inkblots without shock (c 
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pg E varied R. Instead, the standard error of measurement of 
, or W%, should be determined separately for cases where R=10-1 
R=15-25, R 225-35, or some such grouping. » 
"P. reliability of patterns of scores is a difficult problem. If both M 
€ a ere perfectly reliable, any pattern or combination based on the 
21 — ould also be perfectly reliable. But these scores are un- 
Reet jects vary from trial to trial in M or W or both. Nevertheless, 
es = users insist that the pattern" of scores is stable. If there 
E y substance to this claim, it means that certain definable configura- 
ions of the scores are stable even though the separate scores are not. 
The configurations may be as simple as the W/M ratio or may be com- 
plex structures of several scores. One may establish the reliability of 
any composite score by obtaining two separate estimates from in- 


dependent trials of the test. 

The method of determining reliability by independent estimates has rarely 
been used. A study by Kelley, Margulies, and Barrera (30) is of interest, even 
though based on only twelve cases. The Rorschach was given twice, and be- 
tween the trials a single electroshock was given, reportedly sufficient to wipe 
out memory of the first trial without altering the personality. In the records so 


obtained, R shifted as muc ial to trial, and absolute 


h as 50 per cent from tri 
values of some other scores shifted also. In several cases where scores shifted, it 
can be argued that the relationship 


between the scores did not shift and that the 
two records would lead to similar diagno 


ses. The authors made no attempt at 
statistical treatment. Probably this ingenious proced 


U ure will rarely be repeated. 
seful studies could certainly be made, however, by comparing performance on 


f. Swift, 57). Even if the two sets are not 
alent, the data would indicate more about the stability of per- 
hods so far employed. 

set up composite scores, obtain 
hem. Even this is unsuitable 


d out before, a given ratio 


Picty equiv: 
ormance than any meth 


At first glance, it appears logical to 
two separate estimates, and correlate t 


for Rorschach problems, however. As pointed out betor 
such as 20% W or W/M 2.0 has different meaning in different records, 


depending on the absolute value of W. The pattern might conceivably 
be defined by a curvilinear equation, but this becomes unmanageable, 
especially as several variables enter à single pattern. The problem is 
One of defining when two patterns are psychologically similar, and of 


defining the magnitude of the difference when they are not equivalent. 
© one would contend that the W/M balance is unchanged if a subject 
shifts from 12 W: 2M to 60 w: 10 M. The problem is to define and 
Measure the balance in 2 numerical Way. The approach pattern 


W-D-Dd has three dimensions. If we wish to estimate reliability by 
three scores We have a six-dimensional 


Comparing two sets of these $ h 
array, for which no present methods are adequate. So far, even the 
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pattern-tabulation method reduces such data only to four Sum, 
which leaves the problem still unmanageable. All that can be wet 
mended is that additional attention be given to this challenging pro E 
lem. We can now obtain adequate evidence on the stability ofRorschac ; 
patterns only by such a method as Troup's (62), discussed in the first 
section of this paper. It will be recalled that she had two sets of records 


interpreted clinically, and employed blind-matching to show that the 
inferences from the Rorschach remained stable. 


Two unique but entirely unsound studies by Fosberg (15, 16) employed a 
novel procedure to estimate the reliability of the total pattern. He gave the test 
four times, under varied directions. He then compared the four records for 
each person. In one study he used chi-square to show that the psychograms for 
each person corresponded. But this statistical test merely showed that the D 
Score in record 1 is nearer to D in record 2 than it is to W, C, or other scores. 
That is, he showed that the scores were not paired at random. But, since each 


Score has a relatively limited range for all people—i.e., D tends to be large, m 


tends to be small, etc.—he would have also obtained a significantly large chi- 


square if he had applied the same procedure to four records from different per- 
sons. One may also point out that finding a P of .90 does not prove that two 
records do come from the same person, but only that the null hypothesis 18 
tenable, or possibly true. Fosberg's second study, using correlation technique, 
is no sounder than the first. Here the two sets of scores for one person were 
correlated. That is, pairs of values such as W, — W», D, — Ds, etc. were entered 
in the same correlation chart. As before, the generally greater magnitude of D 
causes the two sets to correlate, but high correlations would have been obtained 
if the scores correlated came from two different subjects. 


Objection must also be made to several procedures and inferences of Buhler 
and Lefever (5), in their attempts to d 


proposed Basic Rorschach Score. 
total score, by placing half the sig 
and scoring each person on both li 
halves to indicate reliability. Be 


ems are not experimentally independent. A 
single type of performance enters into a great number of separately scored signs 
(in their checklist, M affects items 1, 2, 5, 6, 7, 8, 10, 11, 12, 51, 52, 53, 86, 93, 
94, 95, 96, 99, 100, 101, and 102). A "chance" variation in M would alter the 


Normal-Schizophrenic key and the Nurse-Schizophrenic key. The evidence !5 
not adequate to show that the weights wo 
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pner dni, Ue hie eee ae 
a 7 e four samples involved had no overlap. (3) 

Certain papers were scored repeatedly, using sets of weigh i m 
parable but slightly different ways (p. 116) E E porta torm 
sets of scores are ome as es ek of aie devas * i er 
rate scorings of the same set of responses is in ii xi = oe 
individual subjects were Leon e solely by m brc peers ol 
correlation when keys havin ' simil: it dee woulo pul pee 
n n keys he g any similarity to each other were applied to the 
a, The reliability of the performance of the subject, and that is what re- 
ility coefficients are supposed to report, cannot be revealed. by rescorings of 

the same performance. 
CONCLUSIONS 

J appended bibliography are convinc- 
ing evidence that Rorschach workers have sought statistical confirma- 
tion for their hypotheses. But the analysis also shows that the studies 
have been open to errors of two types: (1) erroneous procedures have 
led to claims of significance and interpretations which were unwar- 
ranted; and (2) failure to apply the most incisive statistical tests has led 
workers to reject significant relationships. So widespread are errors and 
unhappy choices of statistical procedures that few of the conclusions 
from statistical studies of the Rorschach test can be trusted. A few 
workers have been consistently sound in their statistical approach. But 
some of the most extensive studies and some of the most widely cited 
are riddled with fallacy. If these studies are to form part of the base for 
Psychological science, the data must be reinterpreted. Perhaps ninety 
Der cent of the conclusions so far published as a result of statistical 
Rorschach studies are unsubstantiated—not necessarily false, but based 


on unsound analysis. 


The foregoing analysis and the 


s of statistical rules. The 


ere obvious violation 
nents and introduces prob- 


R Few of the errors w pes 
orschach ic unlike conventional ins ruments é 
lems not carat encountered. Moreover, statistical methods for oe 
tests have not been fully developed (11). It is most eee E . 
research-workers using the Rorschach secure the best em z s ie 
Suidance, and that editors and readers scrutinize stu ies 0 is es 
With great care. But statisticians have a responsibility too, rd 
the logic of Rorschach research and the peculiar character o oer 
tests, in order to sense the limitations of conventional and mathemati- 


cally sound procedures. 
"hore: statistical tools ar 
isable for all studies. Wit 
Sested the following guides to future 
n which a 
terion are esi 


no procedure is equally 


t. And 
w has sug- 


e imperfec d 0 r 
mitations, this revie 


hin these li 
practice. 

clinical synthesis 
pecially appropriate. 


is of each Rorschach 


e Matching procedures i 
rd is compared with a cri 
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2. If ratings are to be treated statistically, it is often advisable to dichoto- 
mize the rating and apply chi-square or bi-serial nol 
3. Common errors which must be avoided in significance tests are: 


a. Use of critical ratio and uncorrected chi-square for unsuitably small 
samples. 


oc 


. Use of sample values in the formula for differences between proportions. 
Use of formulas for independent samples when matched samples are com- 
pared. "M 
Interpretation of P-values without regard for the inflation of probabilities 
when hundreds of significance tests are made or implicitly discarded. 
Acceptance of conclusions when a significant difference is found with a 
hypothesis based on fluctuations in a particular sample. 
4. Counting procedures are in general preferable to additive methods for 
Rorschach data. The most widely useful procedures are chi-square and analysis 


of differences in mean rank. These yield results which are invariant when scores 
are transformed. 


5. Normalizing scores is freq 
involving variance. 


d. 


e. 


juently desirable before making significance tests 


6. Where groups differ in total number of responses, this factor must be 
held constant before other differences can be soundly interpreted. Three devices 
for doing this are: rescoring a fixed number of responses on all papers, construct- 
ing subgroups equated on the number of responses, 
normalized scores (pattern tabulation). 


7. Ratio and difference scores should rarely be used as a basis for statistical 
analysis. Instead, patterns should be defined and statistical comparisons made 
of the frequency of a certain pattern in each group. Use of chi-square with fre- 
quencies of Rorschach “signs” is recommended. 

8. Multiple regression and linear discriminant functions are unlikely to re- 
veal the relationships of Rorschach scores with other variables, since the as- 
sumption of linear compensation is contrary to the test theory. 

9. Rank correlation, curvilinear correlation, or correlation of normalized 
Scores are often more suitable than product-moment correlation. 


10. No entirely suitable method for estimating Rorschach reliability now 
exists. Studies in this area are much needed. 


and analyzing profiles of 


There are in the Rorschach liter 


ature numerous encouraging bits of 
evidence. The question whether 


X the test has any merit seems ade- 
quately answered in the affirmative by studies like those of Troup» 


Judith Krugman, Williams (69), and Munroe. Supplemented as these 
are by the testimony of intelligent clinical users of the test, there 1$ 
every reason to treat the test with respect. One cannot attack the test 
merely because most Rorschach hypotheses are still in a pre-research 
stage. Some of the studies which failed to find relationships might have 
supported Rorschach theory if the analysis had been more perfect: 
How accurate the test is, how particular combinations of scores are t° 
be interpreted, and how to use Rorschach data in making prediction? 
about groups are problems worth considerable effort. With improve 
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3 ents in projective tests, in personality theory, and in the statistical 


ro ERTAN 
seaman for verifying that theory, we can look forward to impressive 
BIBLIOGRAPHY 
1. AngL, T. M. Group Rorschach testing 12. Davipson, HELEN H. Personality and 
in a vocational high school. Ror- economic background. New York: 
" schach Res. Exch., 1945, 9, 178-188. King's Crown Press, 1945. 
. BECK, S. J. Personality structure in 13. Epwarps, A. L. Note on the “correc- 
rage Nerv. and ment. Dis. tion for continuity" in testing the 
> Monogr., 1938, No. 63. significance of the difference be- 
3. Brower, D. The relation between tween correlated proportions. Psy- 
certain Rorschach factors and cardi- chometrika, 1948, 13, 185-187. 
ovascular activity before and after 14. FESTINGER, L. The significance of 
visuo-motor conflict. J. gen. Psy- difference between means without 
\ 3 chol., 1947, 37, 93-95. reference to the frequency distribu- 
4 . Brown, R. R. The effect of morphine tion function. Psychometrika, 1946, 
upon the Rorschach pattern in post- 11, 97-105. 
addicts. Amer. J. Orthopsychiat., 15. FosperG, I. A. Rorschach reactions 
1943, 13, 339-342. under varied instructions. Rorschach 
5. Bunter, C., BUHLER, K., & LEFEVER, Res, Exch., 1938, 3, 12-31. 
D. W. Rorschach standardization 16. Fosnzno, I. A. An experimental study 
studies, Number I. Development of of the reliability of the Rorschach 
the Basic Rerschach Score. Los technique. Rorschach Res. Exch., 
Angeles: C. Buhler, 1948. 1941, 5, 72-84. 
6. Cocuran, W. G. The chi-square cor- 17. FREEMAN, H., Ropyicx, E. H., 
rection for continuity. Jowa St. Col. Suakow, D., & LeBeaux, T. The 
J. Sci., 1942, 16, 421-436. carbohydrate tolerance of mentally 
7. CnowBacu, L. J. The reliability of disturbed soldiers. Psychosom. 
ratio scores. Educ. psychol. Msmt., Med., 1944, 6, 311-317. 
1941, 1, 269-278. 18. Gann, E. Reading difficulty and per- 
8. Cronnacn, L. J. Note on the reliabil- sonality organization. New York: 
ity of ratio scores. Educ. psychol. King's Crown Press, 1945. . 
Y Msmt., 1943, 3, 67-70. 19. GoLDFARB, W. A. A definition and 
9. Cronpacn, L. J. A validation design validation of obsessional trends in 
for personality study. J. consult. the Rorschach examina i OS, 
o, c Z970, 1948, 12, 365-374. en Vac Ress DR n 
- Cronpacn, L. J. Pattern tabulation: ieee -— 
a statistical Los for treatment of 20. GOLDFARB, W. Effects of early mance 
limited patterns of scores with tional care on —— Lager 
particular reference tO the Ror- eget J. Orthopsychiat., s 
aes R Educ, psychol- m 21. GUILFORD, J. P. (Ed.) Printed clas- 
11, CnoNBACH L. J Statistical methods sification tests. AAF epos 
for multi-score tests. Paper Pre Psychology Program edo S. 
TOSES oo m 
ved a Statisti ered q Gus, Avice Estimation of Ror 
| lishe d ecember, 1948. schach scoring categories by means 


428 


of an objective inventory. J- 
Psychol., 1946, 22, 253-260. 
23. Harris, R. E., & CHRISTIANSEN, c. 


Prediction of response to brief 
psychotherapy. J. Psychol., 1916, 
21, 269-284. 


24. Harris, T. M. The use of projective 
techniques in industrial selection. In 
Exploring individual differences, 
American Council on Education 
Studies, Series 1, No. 32, 1948. Pp. 
43-51. 

25. HERTZ, MARGUERITE R. Personality 
patterns in adolescence as portrayed 
by the Rorschach ink-blot method: 
I. The movement factors. J. gen. 
Psychol., 1942, 27, 119-188. 

26. HERTZMAN, M. A comparison of the 
individual and group Rorschach 
tests. Rorschach Res. Exch., 1942, 6, 
89-108. 

27. HERTZMAN, M., & MamcuLiES, H. 
Developmental changes as reflected 
in Rorschach test responses.  J. 
genet, Psyckol., 1943, 62, 189-215. 

28. HertzMan, M., Orvansky, J., & 
Seitz, C. P. Personality organiza- 
tion and anoxia tolerance. Psycho- 
som. Med., 1944, 6, 317-331. 

29. KaBackK, G. R. Vocational personal- 
ities: an application of the Rorschach 
group method. New York: Bureau of 
Publications, Teachers Coll., Colum- 
bia Univ., 1946. 

30. KELLEY, D. M., Mancu.IEs, H., & 
BARRERA, S. E. The stability of the 
Rorschach method as demonstrated 
in electric convulsive therapy cases. 
Rorschach Res. Exch., 5, 1941, 35-43. 

31. KRUGMAN, J. I. A clinical validation 
of the Rorschach with problem 
children. Rorschach Res. Exch., 
1942, 6, 61-70. 

32. KnucMaN, M. Psychosomatic study 
of fifty stuttering children. Amer. J. 
Orthopsychiat., 1946, 16, 127-133. 

33. KURTZ, A. K. A research test of the 
Rorschach test. Personnel Psychol., 


1948, 1, 41-51. 


34 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


LEE J. CRONBACH 


. Leverett, H. M. Table of mean 
deviates for various portions of the 
unit normal distribution. Psycho- 
metrika, 1947, 12, 141-152. 

. LiNpQuisT, E. F. A first course in 
statistics. (Revised Ed.) Boston: 
Houghton Mifilin, 1942. 

McCaxpLzss, B. R. The Rorschach 
as a predictor of academic success. 
J. appl. Psychol; 1949, 33, 43-50. 

McNemar, Q. Note on the sampling 
error of the difference between cor- 
related proportions or percentages. 
Psychometrika, 1947, 12, 153-157. 

Marcuttrs, H, Rorschach responses 
of successful and unsuccessful stu- 
dents. Arch. Psychol, N. Y., No- 
271. New York, 1942. 

MELTZER, H. Personality differences 
between stuttering and non-stutter- 
ing children. J. Psychol., 1944, 17, 
39-59. 

MoxraLro, F. D. An application of 
the Group Rorschach technique to 
the problem of achievement in col- 
lege. J. clin. Psychol., 1946, 2, 254- 
260. 

Munoz, Ruri L. Objective methods 
and the Rorschach blots. Rorschach 
Res. Exch., 1945, 9, 59-73. 

Munroe, Rura L. Prediction of the 
adjustment and academic perform- 
ance of college students by a modi- 
fication of the Rorschach method. 
Appl. Psychol. Monogr., 1945, No- 7 

MuxROE, RuTH L. Rorschach findings 
on college students showing different 
constellations of subscores on the 
A.C.E. J. consult. Psychol., 1946, 
10, 301-316. 

PEATMAN, J. G. Descriptive and san 
pling statistics. New York: Harpe" 
1947. 

PENROSE, L. S. Some notes O 
crimination. Ann. Eugenics, 1 
13, 228-237. 

PIOTROWSKI, Z., CANDEE, D. 
sky, B., HoLrzBERG, S., 
ARNoLD. B. Rorschach sign 


n dis- 


, 


BALIN- 
& VoN 
s in the 


pe 


AT. 


18. 


49. 


51. 


52. 


53. 


54, 


55. 


56. 


57. 


58. 


STATISTICAL 


selection of outstanding young male 
mechanical workers. J. Psychol, 
1944, 18, 131-150. 

Rapaport, D. Diagnostic psychological 
testing, Vol. II. Chicago: Year Book 
Publishers, 1946. 

Ricuarpson, L. H. The personality 
of stutterers. Psychol. Monogr., 
1944, 56, No. 7. 

Rickers-Ovstanxina, M. The Ror- 
schach test as applied to normal and 
schizophrenic subjects. Brit. J. med. 
Psychol., 1938, 17, 227-257. 


. Ross, W. D. The contribution of the 


Rorschach method to clinical diag- 
nosis. J. ment. Sci., 1941, 87, 331- 
348. 

Ross, W. D., Fercuson, G. A» & 
Cuartke, F. C. R. The Group 
Rorschach Test in oflicer selection. 
Bull. Canad. Psychol. Assn 1945, 
84-86. 

Ross, W. D., & Ross, S. 
schach ratings of clinica 
schach Res. Exch., 1944, 8, 1-9. 

Sanpix, T. R., & Mapow, L. W. Pre- 
dicting the depth of hypnosis by 
means of the Rorschach test. Amer. 
J. Orthopsychiat., 1942, 12, 268-271. 

Scmwipr, H. O. Test profiles as 2 
diagnostic aid: the Rorschach. J. 
clin. Psychol., 1945, 1, 222-227. 

SirceL, M. G. The diagnostic and 
prognostic validity of the Rorschach 
test in a child guidance clinic. Amer. 
J. Orthopsychiat., 1948, 18, 119-133. 

SNEDEcor, G. W. Statistical methods. 
Ames, Iowa: Iowa State College 
Press, 1940. 

Swirr, J. W. Reliability 
scoring categories wi 
children. Child Develpm- 
207-216. 

Swirr, J. W. Rorschach responses of 
82 pre-school children. Rorschach 
Res. Exch., 1945, 9, 74-84 


Some Ror- 
value. Ror- 


of Rorschach 


59. 


61. 


62. 


63. 


65. 


66. 


67. 


68. 


69. 


70. 


METHODS APPLIED TO RORSCHACH SCORES 429 


SWINEFORD, F. A table for estimating 
the significance of the difference 
between correlated percentages. 
Psychometrika, 1948, 13, 23-25. 


. THompson, G. M. College grades and 


the Group Rorschach. J. appl. 
Psychol., 1948, 32, 398-407. 

TnonNTON, G. R., & GUILFORD, J. P. 
The reliability and meaning of 
Erlebnistypus scores on the Ror- 
schach test. J. abnorm. soc. 
Psychol., 1936, 31, 324-330. 

Trour, E. A comparative study by 
means of the Rorschach method of 
personality development in twenty 
pairs of identical twins. Genet. 
psychol. Monogr., 1938, 20, 461—556. 

TurcurN, S., & Levy, D. Rorschach 
test differences in a group of Spanish 
and English refugee children. Amer. 
J. Orthopsychiat., 1945, 15, 361-368. 


. VAUGHN, J., & KRUG, O. The analytic 


character of the Rorschach inkblot 
test. Amer. J. Orthopsychiat., 1938, 
8, 220-229. 

Vernon, P. E. The Rorschach ink- 
blot test. Brit. J. med. Psychol., 


1933, 13, 179-200. 
p.E. The matching method 


VERNON, a 
applied to investigations of personal- 
ity. Psychol. Bull., 1936, 33, 149- 
177. 

WALKER, HeLEN M. Elementary 
statistical methods. New York: Holt, 
1943. 


WERN. 3 
brain-injured, mentally defective 
children. Genet. Psychol. Monogr., 


1945, 31, 51-110. . 
wams, M. An experimental study 
of intellectual control under stress 
and associated Rorschach factors. 
J. consult. Psychol., 1947, 11, 21-29. 
yates, F. The analysis of contingency 
tables with groupings based on 
uantitative characters. Biometrika, 


1948, 35, 176-181. 


BOOK REVIEWS 


Hsu, Francis, L. K. Under the ancestors’ shadow: Chinese culture and 
personality. New York: Columbia University Press, 1948. Pp. 
xiv 4-317. $3.75 
Under the Ancestors’ Shadow is an anthropologi+al (or sociological) 

study of "West Town," a small semi-rural community in southwest 

China. The author expresses his conviction that “the essential social 

structure" of West Town is typical of China as a whole, although he ad- 

mits that this judgment can only be confirmed through further study. 
The book is appropriately titled: Hsu's account of West Town cul- 
ture is in some respects far from adequate (in the judgment of a social 
psychologist), but he does a highly effective job of giving substance and 
credibility to his central theme as expressed in the title. Taking up one 
after another aspect of the culture, he elaborates the picture of a people 
who live "under the ancestors’ shadow,”’ and traces the culture patterns 
which flow from this way of life: the cult of ancestors, family unity, sub- 
mission to authority, father-son identification, filial piety, and so on. 
The economic bases of life in West Town are inexpertly trcated. 

Processes of cultural change, though mentioned a number of times in 

passing, are dealt with in perfunctory fashion. But for readers of psy- 

chological background the chief inadequacies of the book stem from the 
fact that the author is first and foremost a careful recorder of what 

Linton has called the overt aspects of the culture, and is not particularly 

skillful in getting at the psychological variables which underlie these 

overt aspects. There are two shortcomings involved: first, he does not 
seem skilled at gathering the kinds of data which would permit infer- 
ences concerning the psychological variables; and second, when he has 
such data available, he does not always interpret them with a sure hand. 
The fact that the book is sub-titled "Chinese Culture and Personality" 
leads the reader to expect a study in the manner of Linton, Kluckhohn: 
or Dubois, rich in materials of interest to the psychologist. It falls short 
of the mark. In dealing with personality, Hsu writes like a man who has 
been handed his conceptual tools after he had completed his field study: 
There is a good deal of internal evidence to indicate that Hsu isa 
conscientious observer and recorder; but on the whole the study canno 
be commended on methodological grounds. Hsu fails to employ the 
various simple forms of controlled or systematic observation which 
would have been readily available to him. His treatment of personality 
structure would have gained immensely from the inclusion of a t° 
autobiographies. He offers little or no information as to his informante 

(The reviewer can see no reason why such ethnological field reporta 5 

this should not include as a matter of course an “‘essay on informa? s. 

no matter how unpretentious: What sorts of data were obtaine erm 

direct observation and what from informants? What kinds of per 
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supplied what kinds of inf i i 
b 1 in ormation? What kinds of person 
est and most reliable sources of information? What sorts amore 


a falsifying appeared in informant reports?) 
n spite of these shortcomings, Under the Ancestors’ Shadow is a use- 


ful additi i 
— to the small but growing shelf of field reports on Chinese 
Joun W. GARDNI 
Carnegie Corporation of New York. TN 
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It has become increasingly apparent over a period of se 
that psychologists, taken in the aggregate; employ the chi-square test 


incorrectly. The number of applications of the test does not seem to be 
increasing, but the number of misapplications does. This paper has 
been prepared in hopes of counteracting the trend. Its specific aims 
are to show the weaknesses in various applications that have been 
made and to set forth clearly the circumstances under which x? can be 
legitimately applied in testing different hypotheses. 

To confirm a general impression that the number of misuses of x* 
has become surprisingly large, & careful survey Was made of all papers 
published in the Journal of Experimental Psychology" during the three 
years 1944, 1945, and 1946. Fourteen papers Were found which contained 
one or more applications of the chi-square test. The applications in only 
three of these papers (1, 15, 18) were judged to be acceptable. In one 
Other paper (21), the several applications could be called "correct in 
principle" but they involved extremely small theoretical frequencies. In 
nine of the fourteen papers (2, 4, 10, 16, 17, 20, 28, 29, 30) the applica- 
tions were clearly unwarranted. In the remaining case,? it was not pos- 
sible to determine what had been done; and the author, when questioned 


twice b 1 ose to reply- 
y letter, did not cho ) in the fourteen papers 


The principal sources of error (or inaccuracy 
f that the psychologists 


elie! 
cal methodologies 


| resulted from a b 


! The choi : i journa 
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Who publish in i d, on the average, 1n statisti 
i etter versed, , Ins ae 
eens probably bee Is. No criticism of the journal nor of individual 
is to illustrate correct 


than are i 
tha wpe ther journals: 
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E his paper, by Pronko (27); 
am Rs an independent judgment re 
Bo uM of x? which were M@C™ gi 
nend in papers published in psychologic 
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just referred to, as well as in papers published in other journals, are as 


follows: 


Lack of independence among the single events or measures? 

Small theoretical frequencies 

Neglect of frequencies of non-occurrence 

Failure to equalize EF, (the sum of the observed frequencies) and 
ZF, (the sum of the theoretical frequencies) 

Indeterminate theoretical frequencies 

Incorrect or questionable categorizing 

Use of non-frequency data 

Incorrect determination of the number of degrees of freedom 
Incorrect computations (including a failure to weight by N when propor- 
tions instead of frequencies are used in the calculations) 


Pane 


sO icum 


These errors will be explained in detail and illustrated with examples 
taken for the most part from books and published papers. 

It is not surprising that errors of the types listed are frequently 
made; several of the standard texts to which psychologists turn for 
statistical guidance contain faulty illustrations. For example, Peters 
and Van Voorhis (26) make four applications of x?, only one of which 
is without flaws; and in one of the applications made by Guilford (13, 
p. 91), there is a failure to equalize XF, and £F; and to calculate the 
number of degrees of freedom correctly. 

A single application made by Peters and Van Voorhis contains the 
first four errors in the above list. Table I is based on their Table 
Table XX XV (26, p. 411). Twelve dice were thrown fourteen times, 
and a record was kept of the number of aces appearing at each throw- 
The observed frequencies F, are entered in the second column of the 
table. A value of x?, given in the last column, was calculated for each 
of the fourteen throws. A composite value of x? was obtained by sum- 
ming the separate values. The first of the four errors in this application 
is that the observed frequencies lack independence. They lack independ- 
ence because the same twelve dice were thrown each time. This means 
that, when the frequencies are grouped, it is impossible to take into 
account the effects of individual differences in the dice and possible 
compensating effects from one die to another. As a consequence, 1° 
statements can be made about the behavior of an individual die, Hos 


3 The term independence, as here used, has reference to individual or single event?: 


In contrast, the hypothesis of independence that is tested by means of x? specifies 4 lack 
of relationship (that is, an absence of interaction) between the variates represented 2. 
contingency table. The events that occur to yield the frequencies of a contingency tab 
must be mutually independent even though the variates are related. 


n 


9 
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11 
12 
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eda orm to generalize the findings to any population of dice fr 

hich the twelve can be considered a sample. Therefore, only h "x 
eses which relate specifically to the twelve dice as a grou p 
tested. More will be said later about this kind of error. PU 
" The second flaw in the application comes from using theoretical 
requencies of 2. These values are too low to yield a quantity whose 


TABLE I 


QUARE TEST BY PETERS AND Van Voonanis (26) 


APPLICATION OF THE CHI-S 


Throw Fo FR (Fo— Fi)? a 

Fi 
1 1 2 i 0.5 
2 3 2 1 0.5 
3 2 2 0 0.0 
4 3 2 1 0.5 
5 1 2 1 0.5 
6 4 2 4 2.0 
7 2 2 0 ae 
8 4 2 4 2.0 
1 2 1 0.5 
0 2 4 2.0 
3 2 1 0.5 
2 2 0 0.0 
3 2 1 0.5 
1 2 1 0.5 

30 28 


? distribution. The third mistake is the 
which are shown in the table as 30 and 


28 respectively. This mistake i$ related to a fourth one—a failure to 
take account of the frequencies of non-occurrence of aces. Any of the 


. à 2 4 
four errors is sufficient to invalidate this use of the x? test. 


distribution approximates the x 


failure to equalize ZF and Fn 


* Tr € ical frequenc! occurrence of aces had been used 
in the ks dee i a n of x2 would have been 12 instead of 10. This 
difference happens not to be large- But in another illustration used by Peters and Van 
Voorhis (26, Table XXXVI, P- 414), the difference islarge. The value of 14,52 is given 
in the EP a AME arail compute f the frequencies of 

rees of freedom remains 


n N i 
the renee the resulting value 1$ ie 
he same, but the calculated value of x* is mo 


ies of non- 
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FUNDAMENTAL THEORY 


The two most basic requirements in any application of the chi- 
square test are (a) independence among the separate measures and 
(b) theoretical frequencies of reasonable size. These requirements can 
be shown in an elementary way by examining a two-category distribu- 
tion of measures. But first, an unequivocal definition of x? is needed. 

If z is a normal deviate in standard form defined in relation to 


population parameters ™ and c, then 
X—m 


g 


and 
san Am : 
=g s= a with df = 1. [1] 
Chi-square with 1 degree of freedom is thus defined as the square of a 
deviation from the population mean divided by the population variance. 
, If there are r independent measures of the variate X, there will be 
r independent values of z, and the resulting formula for x? is 


r 
y-21:u-2 Qum, with df = r. [2] 

i=l 
Values of x? may range from 0 to «, and they have frequency distribu- 
tions which depend upon the value of r. The distribution function 9 
x’, in general form, will be given later. It will then be made clear that 
the chi-square tests of independence and goodness of fit can be applied 
unequivocally only to frequency data (or to proportions derived from 
frequency data). For the present, the plausibility of the two basic 
requirements stated above will be shown through an examination O 

the two-category case. 

The two-category case. Consider a population of N independent events 
(things; measures), each of which may fall either into category A or into 
category B. It is assumed that these categories are clearly defined be- 
fore samples are drawn and that the category in which a given eve? 
falls can be unequivocally determined. A sample is drawn from the 
d the sample data are to be employed in determining 
t a certain hypothesis regarding the proportion of cases 
tenable. If p is the expected (theoretical) proportion 
=1 is distrib" 


population an 
whether or no 
in each category 1$ 
on [1] verifies the statement that the square root of x? with df 


5 Equati 
r Student's D with df=. 


uted as 2 (0 
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ut category A and q the expected proportion for category B, it follows 
Ptg-l 
The ili i i 
Porat itd Med ee ae 
N! 
ni(N — n)! Pu I] 
btaining the successive terms arising 


P(n) = 


puse is the general expression foro 
rom the expansion of the binomial (b+9)*. The limiting form of equa- 


tion [3], as M becomes indefinitely large, is a normal distribution func- 
tion having a mean of Np anda variance of Npg. In symbols, 


© g- (7 ND IBNp, [4] 


1 
lim P(n) = == 
N-- NZ E47] 
Np is the population mean of category A. It is the expected value of 7, 
that is, the theoretical frequency to be associated with category A. 


Nq is the corresponding theoretical frequency to be associated with 
‘As stated in equation [1], 


category B. Mpg is the population variance. . 
the square of a deviation from the population mean divided by the 
population variance is distributed as x" with 1 df. Thus, from equation 
[4], it is seen that the quantity 
n— Np)? 
"pP ( p) [5] 


N40 
Would be distributed exactly as x? with 1 df, provided that N is indefi- 
nitely large. In the two-category case, equation [5] may be employed 
to calculate an approximate value of x’. It should be noted that both 
b and q appear in the denominator of the right-hand term. No restric- 
tion is placed during the calculation; the equation gives an approximate 


Solution for any value ot 4V. 
y y used in the t 


The formula that is common! wo-category case to ob- 


tain a value of x? is 
— Np)? N—n- Ng? 
NI " 
Np Nq 
where Np and Ng are theoretical frequencies and n and (N—7) are the 
robability P(N-—n) that (N —n) events 


_ “If the investigator is concerned with the p 
Will fall into category B, the limiting form of eq 
1 


uation [3] would be written 


— eon Na INTO, 4a] 


——— RE 
= PN = a) = às VNp 
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corresponding observed frequencies. The prime symbol is placed on 
xi in [6] to distinguish between x? as defined by equation [5] and x? as 
ordinarily calculated with formula [6]. When [6] is used, the number of 
df is 1 less than the number of categories because one restriction 
(2F.=N) is imposed on the theoretical frequencies. Therefore, the num- 
ber of df for [6] is 1, just as it is for equation [5]. 

It can readily be shown that x? and x” are identical quantities. It 
is for this reason, and this reason alone, that formula [6] may be used in 
obtaining an estimate of x? in the two-category case." 

The foregoing discussion reveals the two limitations that hold in any 
application of the chi-square test. The first limitation is that x? is cor- 
rectly used only if the N events or measures are independent. Equa- 
tions [3] and [4] are valid statements only when independence exists. 
'The second basic limitation relates to the size of the theoretical fre- 
quencies. If Np (or Nq) remains small as N becomes large, the limiting 
form of the binomial distribution function is not a normal distribution, 
as was assumed in writing equation [4]. If, for any reason, either Np 
or Nq is small, the limiting form of equation [3] is the Poisson distribu- 
tion function. Under such circumstances, the quantity on the right of 
equation [5] would not be the square of a normal deviate divided by the 
population variance and, consequently, would not be distributed as x 
with 1 df. i 

It should be emphasized that the categories are assumed to be desig- 
nated in the population before the individual sample is drawn. It should 
also be emphasized that the equating of the sums of observed and 
theoretical frequencies and the use of the frequency of non-occurrence 
(in this case, the frequency with which measures fall in category I 
are necessary to establish the identity between the quantities defined in 
equation [5] and equation [6]. 


7 If observed and theoretical proportions are used in calculating values of x’, equations 
[5] and [6] become 


peg E RE [5a] 
pie 
or 
= (Yo — go". (5b] 
Pee 
and 
pap a ge. [6a] 
t qt 
In these equations, Nis the total number of cases while p, and gs are the observed nr. 
ectively: 


are the theoretical proportions for categories A and B, resp A 
be derived from [5] and [6], and they reveal that if proportions e 
the values calculated from the proportions must be mu 


tions and pr and qi 
The equations may e 
used instead of frequencies, 


plied by N. 


wh 
^ 


p———— FO. 
———— ——— T | 
——HHNRN 

a 
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_ The more general case. It is common 7 

into several categories instead of just kha ag hear a 
tension of the ideas discussed in the two-category case 52 tei aes 
s. The basic features of this extension will died 
no new ideas enter into the development. The 
ore complex, but the underlying ideas are the 


any number of categorie 
be presented. Actually, 
proof is mathematically m 


same. 
Consider a population of N independent events, with k possible out- 


comes, v 9» v + - - v. Assume that, in the population, 
7, occurs with a probability of pi 
y» occurs with a probability of p2 
v3 occurs with a probability of ps 


rs with a probability of pe 
nts exactly 21 will fall in 
- my in t, is given 


7, OCCU 

P(n;) that out of N eve 
ll in category V2, "s in v3, and + * 
stribution function 


N! 


The joint probability 
category vı, 2 will fa 
by the multinomial di 


[7] 


- ppap sor bs 


“Mk: 


P(n) = 
my nalis! 7 * 


where 


k 
Yu = N. 


iol 

ich the distribution func- 
nt that the measures (fre- 
etc.) of a multidimen- 
e a legitimate ap- 
he table. And be- 


This is the fundamental expression from wh 


tion of x? is derived. It confirms the stateme 
quencies) in the various cells (categories, classes, 
sional table must be mutually independent to enabl 


plication of x? in testing any hypothesis concerning t 


cause equation [7] is written in terms of the frequencies 11, 72 773 * ` ` ng 
the AT is and goodness of fit, based as they 


of independence | 
are on a distribution d from [7], may be used unequivo- 


function Mun : 
cally only in relation to frequency data- . 2 
The distribution function mbolized by g(x), may be 


of x’, here sy 
Written 


[8] 


c oe)? ee, 


gx 
s with 7, is given by 


Where C,, a coefficient which change 


pa "mt ? to frequency 
. * This statement limitin $ pe li dm fi 
faissses eo ding confidence imits fo! 
ak pecial applications suc ariances 

nown sample variance OF rd the end of the paper. 


Sj H . s 
Pecial applications are menti 


data is not meant to exclude cer- 


r a population variance from 
for homogeneity. These 


testing seve! 
oned again towa 


440 DON LEWIS AND C. J. BURKE 


-— — D 
26 r(r/2) 
It should be emphasized that equation (8] is an exact distribution func- 
tion for the quantity defined in equations [2] and [5] but only an ap- 
proximation of the distribution of the quantity defined by equation [7]. 
Its used in relation to equation [7] requires three separate approxima- 
tions, each of which assumes a theoretical frequency of reasonable size. 
The three approximations are: 
1. Replacing each of the factorials in equation [7] by its Stirling approxi- 
mation. 
2. Taking a step similar to the one whereby (1--[X/n])" is replaced by e* 
when 7 is large. 
3. Substituting a continuous integral for a summation of discrete quantities. 


All of these approximations are quite acceptable and lead to inconse- 
quential errors so long as Np is reasonably large. This reaffirms the 
fundamental requirement that the theoretical frequencies must not be 
small, if any calculated value of x? is to be distributed as ect? 


General computational formula. The formula that is commonly em- 
ployed in calculating values of x’ is 


(Fo — F,)? 
x - 
25 = 


Á, [10] 


L This expression for C, contains the gamma function T(r/2). A gamma function is 
a function which reduces to a factorial whenever the argument is an integer. In the 
general case, (n—1)!— T(n). Equation [9] may be written 


Lae [9a] 
20m (— 5) 
2 
actorial in [9a] is an integral number, and its value 
determined in a straight forward manner. On the 
the "factorial" is fractional, and its value must be 
table of the gamma function or by using theequation 


C35 - GNI) (Ewa 


2 


* distribution function for various values of ” 


C,- 


Whenever r is an even number, the f. 
along with the value of C,, can be 
other hand, if r is an odd number, 
determined either by referring toa 


A reader who is interested in plots of the x 
is referred to Lewis (19). 


10 A derivation of equation [8] from equation [7] is presented in considerable detail 
by Greenhood (11) and can be followed by persons familiar with advanced calculus: 
Greenhood's development, which is more complete but similar to one given by Fry 
(8), indicates clearly the limitations which hold in applications of the chi-square test to 
frequency data. It is not necessary, of course, to derive equation [8] from the multi- 
nomial distribution function; it may be derived directly from the joint normal distribu 
tion function in » variables. 
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al, are observed and theoretical frequencies and 
Il cells (categories) of the table. Th 
f categories and reduces to the form of 


[S] 2 the number of categories is 2. It serves as a constant remind 
at the chi-square tests of independence and goodness of fit can be 


applied unequivocally only to frequency data. 


erLICATIONS: I. THE Goopness OF FIT 

or DISTRIBUTION FUNCTIONS 
. chi-square test is i 
ating the hypothesis that a set of frequency ce cg — À 
represented by some specified distribution function. It makes little 
difference what the function is, 80 long as its fundamental properties 
are known. The goodness of fit of binomial, Poisson, and normal distri- 


bution functions is often tested. 
Reo wee case. The correct use of x? in connection with a symmetrical 
i T distribution function (where p =9) can be illustrated with data ob- 
ele ed in a coin-guessing "experiment. A coin was tossed, and 96 students of 
hs mentary psychology each guessed whether the coin came up “heads” or 
tails.” The hypothesis to be tested is that the guess of each student, like 
the fall of the coin itself, was & purely chance occurrence and that each student 
was as likely to say heads as to say tails. The results are shown in Table IT. 


epa F, and Fi, as usu 
the summation extends over a 
equation holds for any number o 


A 


One of the commonest applications of the 


Two-category 


TABLE Il 
DATA FROM A CorN-GUESSING EXPERIMENT 
Hp (= Fo for heads) 68 
H, (=F; for heads) 48 
T. (=Fo for tails) 28 
48 


Ti F, for tails) 


nd third rows; 68 students guessed heads and 28 guessed 
48 and 48. The value of x? is obtained as 


As seen in the first à 
frequencies are 


tails. The theoretical 


follows: 
2 - 28 — 48)* 
m-ar Uf) = (gp — 4x , BO = 16.672 
x- ML pa m t B 
he 0.1% level of confidence; 


With 1 df, this value is significant at better than t 
two-category Case when p=@ 


11 Two alternative formulas that may be used in the 


a 
re as follows: £= (Ho — HQ/H9; 
(Ho = To)? 
x gm amt a 
Ho + To 


ents of the one used above. 


T . 
hese formulas are the exact equival 
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so it must be concluded that the guesses of the 96 students were somehow biased 
ü po d emm of x2, illustrated with two-category data. A fundamental 
eee of x? is indicated by the rule which states that the sum of any ed 
of separate and independent values of x? is distributed as xë, the number o n 
being the sum of the separate df's. The rule will now be applied in relation 
coin-guessing data. Two applications will be made, the first of which is incor. 
rect. This incorrect application is purposely included as a means of era 
how it differs from a correct application and also of revealing the out 
many of the errors made by investigators when employing the chi-square TES 
Ninety-six students of elementary psychology were given five successive 
“trials” in coin guessing. A single coin was tossed five times. After cach toss, 
the 96 students each guessed whether the coin came up “heads” or "tails. 
Each student wrote his five guesses in order on a sheet of paper. The turn of 
the coin was never revealed. The results are summarized in Table III. The 


TABLE III 


Data FROM A SECOND CorN-GuESsING EXPERIMENT 


Tosses 
1 2 3 4 5 
He 68 49 39 54 54 
Hi 48 48 48 48 48 
To 28 47 57 42 42 
T, 48 48 48 48 48 
x 16.67 .04 3.37 1.50 1.50 7223.08 


frequencies for toss 1 are the same as those given in Table II. 
With the ten pairs of observed and theoretical frequencies in Table I 


(two pairs for each toss), it is possible to test five separate hypotheses—that the 
guesses of the 96 students on any one of the five tosses were ch 


ance occurrences 
and were as apt to be heads as tails. 


Five values of x? are given in the bottom 

row of the table. Each of the values was computed with the formula employed 
with the data of Table II. The number of degrees of freedom is 1 in each case- 
For 1 df, the value of x? at the 5% level of confidence is 3.841. Four of the 
five calculated values are less than this and provide no satisfactory basis for 
rejecting the hypothesis of chance occurrence. 

To secure an estimate of x? which makes the probability of rejecting a false 
hypothesis large, it is desirable, when conditions warrant, to summate separate 
estimates of x* and obtain a composite estimate. The number of df for the com 
posite estimate is always the sum of the separate df's. The sum of the yin 
values in Table III is 23.08. The number of df is 5. The hypothesis under 
test is that the guesses of students on five successive tosses of a coin are purely 
chance occurrences, with the probability of a guess of heads (by any studen 
on any toss) equal to the probability of a guess of tails. The composite value © 
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the 0.195 level of confid 

à evel ence, and w 

s if the application of the test were de; 
te the five values of x? in Table III. Th í 
6 students from one toss to the next caiman 


be ass ^ e t 
H, ond E. T - e been independent. In other words, the tabulated values of 
of pibe m E "is toss are interdependent. Consequently, there is no way 
basond os gin unbiased estimate of the theoretical frequency for any toss 
reasonable to st (unless previous guesses are completely ignored). It is un- 
Boyar etr iss li that a student's knowledge of his guess on one toss did 
hatendi E = guess on succeeding tosses. (It will be shown from the data 
assumption R mump Gae unsound.) The x* test should never be based on an 
each etude hat is already known to be false. On a single toss, the guess of 
udent was independent of the guess of the other students; but between 


toss j 
"* the five guesses of each student were undoubtedly interrelated. 
A lack of independence between ts (measures) is the common- 


NE eu separate even 
n the applications of x? that are made by psychologi ix 
e ELE y psy gists. Six of the 


: 7, 28, 29, 30) referred to in the opening para 
contain applications having iode prepis 


There Ó this shortcoming. 
ens. The are two ways in which the responses o! 
illustrati y may be related from trial to trial, as they w 
individual o or they may be internally linked within a le tri 
thei ae a subjects each ma ke more than one response per trial, linkages among 
ea aaue within the trial must result unless there are no individual dif- 
them E Many investigators ignore this restriction and apply the X* test even 
spo gn the same subjects are used from trial to trial and make several re- 
nses on each trial. 


NF... correct use of the ad 
med which separate values 
rated with two-category data. Five 1 


n S: 

ecd. single guess on each of five successiv! z i 
sheet SE per group was 86. Each subject wrote down his guesses 1n order on a 
The paper, the sequence of his guesses remaining on the sheet before him. 
results for the five groups ed in Tables IV (A to E, inclusive). 


Tw i are summariz 
enty-five separate values of x appear i of the five sections 


of tl n the bottom rows 
the table. The question to be answere 


legiti d is: Which of these 25 values may be 
egitimately summaled and which may not be? The five values in any one of the 
fully added for reasons 


hes cannot be meaning already given in the discus- 
ion of the data in Table III. However, the five values in the five sections for 
ated to yield a singl 


edi significant at better than 
P a rejection of the hypothesi 
E" ut it is not correct to summa 

ason is that the responses of the 9 


f subjects may be interdepend- 
ere in the coin-guessing 
single trial. Whenever 


with two-category data. The conditions 
be legitimately summated can be 
on-overlapping groups of subjects 
e tosses of a coin. The number of 


ditive principle 
of x? may 


any single toss may be summ e composite value having 5 
a Ba example, a composite value may be obtained for the third toss and may 
the sed to test the hypothesis that, on the third toss o! coin, the guesses of 
ot Peay pi of the five grouPs were ra ire he probability 
he heads guess equalling that of a tails guess. Neither the presence nor the 
If d nce of individual biases would nullify a meaningful test of this hypothesis. 
gu he hypothesis could not be retained, it would be correct to conclude that the 
€sses were not chance occurrences and were perhaps influenced by what had 
Bore before. The difference between this case and the one illustrated in Table 
Š I is clear-cut. The data in Table III are for five successive guesses by one 
Toup, and the successive theoretical probabilities cannot be established 
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TABLE IV 
Data FROM A THIRD CorN-GUESSING EXPERIMENT 
Tosses 
1 2 E] 4 5 
A. Group 1 
H, 63 56 36 45 41 
a 43 43 43 43 43 
Ts 23 30 50 EU 45 
T, 43 43 43 43 43 
xt 18.60 7.86 2.28 0.19 0.19 
B. Group 2 
Hs 63 56 42 48 45 
Hı 43 43 43 43 43 
T. 23 30 44 38 41 
T, 43 43 43 43 43 
x? 18.60 7.86 0.05 1.16 0.19 
C. Group 3 
H, 65 55 40 48 16 
Hi 43 43 43 43 43 
Th, 21 31 46 38 40 
T, 43 43 43 43 43 
x 22.51 6.70 0.42 1.16 0.42 
D. Group 4 
H, 68 54 38 52 41 
H, 43 43 43 43 43 
vel 18 32 48 34 45 
Ti 43 43 43 43 43 
x 29.07 5.62 1.16 3.77 0.19 
E. Group 5 
Ho 72 57 30 62 38 
Hi 43 43 43 43 43 
T. 14 29 56 24 48 
75 43 43 43 43 43 
g 39.12 9.12 7.86 16.79 1.16 
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without making specific assumptions concerning prior events. In contrast, 
when the five x? values for a single toss are taken from the five sections of 
Table IV and summated, the only assumption that is made is that the guesses 
on that particular toss were in accordance with a theoretical probability for 


heads of .50. 
Composite values of x? for the five tosses are given in Part A, Table V. 


TABLE V 
VALUES or CHI-SQUARE BASED UPON DATA FROM TABLE IV 


Part A. Composite Values of x? 
Toss x? 


Om U N e 


Part B. Values of x? for Combined Frequencies 
Toss x df 


125.17 
36.92 
7.82 
14.88 
.149 


Uno OC à 


The number of df in each case is 5. Except for toss 5, the composite values are 
all significant e better than the 196 level of confidence. The pn toss 
5 falls near the 80% level and lends support for the belief that on a ae ral 
cessive toss of a coin, the probability of a heads guess is .50. The E Em or 
tosses 1 and 2 show that, on these tosses, the guesses were strongly iad p 
ward heads. Conclusions regarding tosses 3 and 4 should bee in t £ light 
of the | ributions made to the composite values for these tosses Dy the 

volens im in Table IV-E, the x* values for tosses 3 and 
guesses of group 5 alone. As seen 1n able i 


ively Ji df in each case, both values are 
4 are 7.86 and 16.79, respectively. wig entis eA eer 


significant at better than the 1% level. à rer iiu 
four parts of the table all fall below the 5% level. However mea ripas 
tosses 3 and 4 are in the same direction for all five groups, an 
Cates i from chance expectations. M 
Ji latioae o omine for each toss separately fhe m 
-D- i 
theoretical frequencies liste -C-D-E, and use the resulting 


d in Table IV-A-B h s | 
Sums to compute values of x^, each with 1 df. yanes of x? obtained uh U 
Way are given in Part B of Table V. Asin the case of = es ien De 
9f the values except the one for toss 5 are significant at d than : P A ave 
9f confidence. Nevertheless, the procedure of et ppm pee. 
Ommended except where the theoretical frequencies for a bl geven igit 
cated Ense eeu are too small to yield satisfactory incivi 
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f freedom is, the 
2, Other things equal, the gaa pes wer cem d rejecting a false 
A SY stable is a value of x? and the greater is the p : 
wo cp of x? may be selected from Table IV-A-B-C-D-E iere in 
gea to test the hypothesis that the guesses of persons on ee eet 
pena of a coin are chance occurrences, with the probability ofa seal the 
equalling that of a tails guess. The hypothesis is inclusive enough to pend 
Aue population from which the groups of subjects were randomly = pve 
To provide for independence between tosses, it is necessary to pepe A oe 
values of x? so that there is one value for each toss and so that M per 
are based on the guesses of a single group. To this end, numbers fro — 
were assigned to the five sections of the table. The x value for toss pss 
taken from the section whose number first appeared in a table of ra on 
numbers; the x? value for toss 2 from the section whose number next appea on 
and so on. The values thus chosen are shown in Table VI. The compos 


TABLE VI 
COMPOSITE VALUE oF CHI-SQUARE, B. 


ASED ON Data IN TaBLE IV 


Tosses p 
pete z 
1 2 3 4 5 
H, 63 54 30 48 46 
H, 43 43 43 43 43 
Ti 23 32 56 38 40 
T, 43 43 43 43 43 
m$ 
—— > _ 5) 8_ OES 
a 18.60 5.62 7.86 1.16 0.42 
Composite x? = 33.66 


value is 33.66, with 5 df. It is hi 
believing that the hypothes 


r 
ghly significant and leaves no grounds » 
The discussion in the p 


is is true. 


15 
empirical results conform " E 
F E ey do conform, it may justifiably be conclude hic 
the penny is "unbiased." The test is an unequivocal one. The extent to W "die 
the investigator should generalize to the collection of pennies from hich as 
single penny was chosen is a matter for Personal judgment. The x? test 
made, reveals nothing concerni 


nny 
> rning the probability that the selected Pe 
either represents or misrepresi 


ents the collection of pennies. of 
Suppose next that two pennies are ran 


dom jon 
H ly selected from a collect! 
pennies and that each penny 1s tossed 50 


E sses: 
times to give a total of 100 to 
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urrence of heads and of tails are combi 

be assumed that the probability of a xà koe b 
applied to test this theoretical probability. 
makes its own unique contribution to the res 
hile the other is unbiased, the obtained value 


of x? could easily be significant and lead to a rejection of the hypothetical 
probability, even though it is correct for one of the pennies. Furthermore, one 
penny could be strongly biased for heads and the other equally strongly biased 
or tails, and the obtained value of x? would turn out to be insignificant. The 
possible presence of individual idiosyncrasies precludes an unequivocal applica- 
tion of x2. The same thing would be true if five pennies were randomly selected 
each one tossed, say 20 times, and the results pooled; or if 10 pennies were 
selected and each tossed 10 times, or 20 selected and each tossed five times. 
Suppose, finally, that 100 pennies are randomly selected from acollection 
le time, and that the number of 


of pennies, that each penny is tossed a sing 
heads is recorded. It may be assumed that the probability of a turn of heads, in 


the population from which the pennies are selected, is 50. The fall of each coin 
is clearly independent of the fall of every other coin. The x? test may be legiti- 
mately applied to determine whether or not the observed frequency of heads 


conforms to the hypothetical frequency. The results of the test can be gen- 
ion of pennies. This would hold even though less 


eralized to the entire collecti 

than 100 pennies were selected, so long as a sufficient number was chosen to 

provide theoretical frequencies of the occurrence of heads and the occurrence of 
2 test. No state- 


tails of sufficient magnitude to warrant an application of the x? tes 
ments can be made, of c encies of any individual penny. 


ourse, regarding the tend 

The crucial point is that frequencies obtained from individuals, whether 
pennies or subjects in psychological experiments, should not be pooled if the 
X! test is to be used, except W 
biases or idiosyncrasies amon 


hen it can be shown tha 

g them? or when “interaction 

specifically under scrutiny- Results on individuals may be combined, but the 
combining should be done after the x? test has been applied to the data on 
individuals separately. For example, if two pennies are each tossed 50 times, 
the x? test may be applied to the results for each penny separately, and then 
the two values of x? may be added to provide a composite value. Similarly, 
separate values of x? may be obtained from the guesses made by two individu- 
als. The separate values d to furnish a single composite 
value. As in the well-known analy: e techniques Lar eere 
of variability contributes to the to e of variability 


each sourc 
should be allowed to make its contri 


2. Unfortunately, 
x? procedures provide no Way: as analysis of variance techniques do, of intro- 
ducing statistical controls over individual subjec 


The frequencies of occ 
the two coins. It may 
x? test cannot be meaningfully 
The reason is that each penny 
sults. If one of them is biased w 


ts as a source of variability. 


2 The y A re sets of frequencies to obtain a single value of x* is 
Miei c iios the uheterogeneity” or "interaction ae be 
data. In this connection, see Snedecor's discussion (31, pP- 191-192) of “pooler * n 
total” chi-squares. F : 
* If an investigator firmly. intends to restrict all generali: 
Sons studied—the group considered in lofo, asa sort of amorph 
of individual frequencies may be logically defended. In = 
analogous to a single individual and must be treated as such. 


zations to the group of per- 
ous mass—then the pooling 
h a situation, the group is 


448 DON LEWIS AND C. J. BURKE 


Therefore, in the use of x°, the control over individuals must be introduced as 
intrinsi of the sampling process. . 
i pics ne case (single dimension). Frequency data sometimes fall Lm 
several categories along a single dimension. If the frequencies from category 
to category are independent and if some hypothesis regarding their decir 
can be meaningfully set up, then the chi-square test may be used to ev aluate 
the hypothesis, provided that the theoretical frequencies for the various p 
gories are of reasonable magnitude. The correct application of the test in A din 
asituation will be illustrated with data obtained in die throwing. A single die 
was thrown 120 times. There was no reason for believing that any throw was 
influenced by any other. The results are given in Table VII, where the first 


TABLE VII 


Data OBTAINED IN Diz THROWING 


F—F) 
Face F, F, em 
1 23 20 45 
2 20 20 .00 
3 22 20 .20 
4 15 20 1.25 
5 18 20 .20 
6 22 20 .20 
p» 120 120 2.30 
Epp 


column lists the six faces of the die and the second column gives the number of 
times that each face turned up. The theoretical probability, on each throw: 
that any specified face of the die would turn up was 1/6. Consequently, the 
theoretical frequency of occurrence of each of the six faces was 20, as shown 
in the third column of the table. A single restriction, the sum of the observe 
frequencies, was placed in calculating the theoretical frequencies. Conse 
quently, there are 5 df. With 5 df, the obtained value of x? falls near the 807 
level of confidence, and there is no basis for rejecting the hypothesis that the 
fall of the die was, on each throw, a strictly chance occurrence. 


^ [t would be possible to use the data in Table VII to test six separate hypotheses 
that the appearance of each of the six faces was a chance occurrence, with a probability 
of 1/6. For example, to test the hypothesis that the appearance of the ace (one-spot) 
was a chance occurrence, a value of x? would be computed as follows: 


— 20}  O7— 
me (23 — 20) 4 ( 100)? 


20 100 = .45 + .09 = .54. 


The number of df is 1. Observe that in the calculation, the frequency of ud 
rence of the ace was taken into account. (The probability of occurrence of the d is 
1/6 [=p] while the probability of its non-occurrence was 5/6 [=q]. This is an exa ae 
of an asymmetrical binomial.) It is clear that six separate values of x? could be comP 


i 
q 
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n of a multi-category frequency distribution (alon 

n the coin-guessing experiment. A e 
ails on each of five successive tosses of 
the ones whose guesses were tabulated 
valuate the hypothesis that chance 


? Another illustratio 
single dimension) comes from results o 
of 439 subjects each guessed heads or t 
a coin. All but 9 of the subjects were 
ri Table IV. The x? test will be applied to e 
actors operated in determining the frequencies of occurrence of the various 
— “patterns” of guesses. With five successive tosses and five successive 

esses, there were 32 possible patterns, as shown in the first column of Table 


VILL. If there was no biasing of the guesses (that is, if the guess of every subject 
ds as to be tails), then each of the 32 pat- 


on every trial was as likely to be hea 

Fn was as probable as any other; and with N —439, the theoretical frequency 
or each pattern was 1/32 x439 213.7, rounded to the first decimal place. This 
except where parentheses 


is the value shown in the third column of the table, 
oes not equal 439 exactly, but equals 438.4. One 


appear. Note that 32 X 13.7 d 

of the requirements in the application of x° is that Æ =} Fe Therefore, six 

of the theoretical frequencies are given as 13.8. These are the theoretical fre- 

quencies corresponding to the six largest observed frequencies. This insures 
ualizing > F. and 27: operates 


that any slight error that may result from eq 
— FP Fi was secured 


to make the test more conservative. 
In the calculation of x? for Table VIII, a value of (Fo 

for each row. These values are gi the fourth column, and their sum 
(=681.122) is the desired estimate. The number of df is (32-1) =31. The 
only restriction placed in figuring the theoretical frequencies was EA, and 
this meant the loss of a single degree of freedom. Even with 31 df, the value 
of x? is so large as to leave no basis whatever for retaining the hypothesis that 
the guesses were chance occurrences. 

ihe orma distribution func 
e t of a normal curve to à 5€ i 
id nn correct except for an occasi f 
: ency to use some theoretical frequencies 


1 "s " " 
mportantly, an incorrect specification of the nu 


he correct procedure will now be illustrated. . l 
ores of 486 students in a course in ele- 
X s of class intervals 


- The distribution of the midterm 5° rte A e mid point 
'ntary psychology is shown in Table IX. The mCP’ j 

of ten e Min, iE given i the X-column, frequencies 1n the F,-column. 

The mean M of the distributio is 104.0, while the standard dengue 

is 1 i nal curve to 

6.1. There are two metho theoretical es that correspond to 

f areas under 


the dat og : 

a (that is, in calculating the RAM 
the , 5 d involves the estimation © 1 
observed frequencies): vap aoe f multiplying ordinate 

urve A cedure. A more 


se; 
va of the normal ¢ Thi an ap 
es by the class interva as This is 2 ; 
ed here) is to obtain L Linie 


exact me 

thod (and the one us ytan 

Proportions taken from à table of the probability integral. 
in the way just indi alues could not then be legitimately summated to 
Yield à dune aper p» They could not be combined because they would 
lack Hidereudene <> as " non-occurrence in each calculation would include 

nce; the 

€ fre à her five faces 
quency of occurrence of the ot r i ilford (13, p. 9D 


© For an example of this me?" eG 


ven in 


s often used in evaluating 
Applications of this type 
ize Fo and Z} Fs a 
ll, and, most 
of freedom. 


tion. The chi-square test i 
t of frequency data. 
‘onal failure to equalize 
that are too sma 
mber of degrees 


ds that 
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The column in Table IX labeled X" gives the upper limits of the various 
score categories. Deviate scores and z scores based on the values of X’ are 


TABLE VIII 


ANALYSIS or CorN-GuEssiNG RESPONSES OF 439 Sunjrcrs* 


R — AL 
Patterns F, F, mne 
Fi 
HHHHH 25 (13.8) 9.089 
BL. BORHOBNOT 12 13.7 0.211 
HOHOBCT OH 22 13.7 5.028 
HHHTT 18 13.7 1.350 
HHTHH 29 (13.8) 16.742 
HHTHT 96 (13.8) 489.626 
HHTTH 22 (13.8) 4.872 
HHT? TT 14 13.7 0.007 
Ho? HH A 5 13,7 5.525 
HTHHP 15 13.7 0.123 
HTHTH 12 13.7 0.211 
ECT fh T oF 4 13.7 6.868 
HTT AR 33 (13.8) 26.713 
HTTHT 17 13.7 0.795 
HT T TR 12 13.7 0.211 
aT TE NX 3 13.7 8.357 
ce a sa a | 3 13.7 8.357 
fT oH HORT 7 13.7 3.277 
THAT 10 13.7 0.999 
THEY 14 13.7 0.007 
T HT Ho 5 13.7 5.525 
THTIET 1 13.7 11.773 
T HT T H 6 13.7 4.328 
(4S HT TOT 0 13.7 13.700 
T TH HA 4 13.7 6.868 
TT ET 6 13.7 4.328 
T TOHOCT OH 25 (13.8) 9.089 
TTRETT 7 13.7 3.277 
yy TT Re 2 13.7 9.992 
TT T HOT 5 13.7 5.525 
T TTTH 2 13,7 9.992 
rT Ee Tt 3 13.7 8.357 
UU Om 439 439.0 x12 681.122 
in 
+ All but nine of the subjects are the same as those whose guesses are analyzed ! 


Table IV. the 
e fourth and fifth columns. Proportions of the total area under he 


in th : í > 
shown 1 e from —® to 3 are given in the P column. Proportions 9 


normal curv 


ST a 


Pea 
d Tson (24), that the number of re 
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a in the se orresponding to the various score interv: are shown 
area gments ct onding th 

d als 
n coim mn P’ and were obtained by taking the differences between the successive 
values of P. The theoretical frequencies came from multiplying the values of 


P’ by N which is 486 in this case. 


TABLE IX 


APPLICA 
ICATION OF THE CHI-SQUARE TEST IN EVALUATING THE FIT OF A NORMAL 
CURVE TO A SET OF FREQUENCY DATA 


Fi (Fo— Fi)? 


Xx Fo Pd x z , 
(-X'-M) (ex/e) P P PN OR 


(+) (1.0000) 


144.5 l - 0141 6.9 
: 35.5 2.195  .9859 .926 
134.5 22 0437 21.2 
— 129.5 25.5 1.574 — .9422 
5 56 .1125 54.7 .031 
"" 119.5 15.5 0.953  .8297 
.8 112 1960 95.3 2.926 
llis x 109.5 5.5 0.342  .6337 
S ifi 2440 — 118.5 .475 
99.5 —4.5  —0.280 .3897 
94.5 94 .2032 98.8 .233 
m go.  -—14.5 0.891  .1865 
S084 1211 58.8 .392 
Jis 79.5  —24.8  -1.512  .0654 
027 .0489 23.8 
i. 69.5  —34.85  —2.133  .0165 
. 7 .0135 6.6 .855 
54 59.5  —44.5 —2.754 .0030 
54.5 2 .0030 1.4, 
(—:) (0000) 


x 486 


f F, are less than 10, they were 
onding values of Fo. 
ue of x2. Seven dif- 


Because the first and the last two values o 
as were the corresp! 


combined with j p 
he adjacent values, 
he sum of : 5 J . oli is the val 
the table ( 5.538) is the 
desi pee y pt Iculations. The number of de- 


erences between F, and Fi entered into the ca of 
8rees of freedom is 7-3 —4. Three degrees were lost because three restrictions 
Were placed in determining the theoretical frequencies. The restrictions were 
uate the view, erroneously attributed to 
d in fitting a normal curve is 1 or 3, 
o test. There is only one 
a normal population. If 


ws t 
Ome statisti xts (6, 25; 26) perpet 
tistics texts (6, "irictions impose 


e . " i ji: t 
due upon the hypothesis that the investigator : a 
thesis open to test—that the frequency data aro 
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the computed values of $:F, M, and c. The hypothesis being tested is that 
the frequency data arose from a normal population. With 4 df, the probability 
of obtaining, by chance, a value of x? greater than 5.538 is around .25; so the 
hypothesis is tenable. 

The Poisson distribution function. 1f the probability of the occurrence of an 
event is quite small, so that Np remains small even though N is relatively large, 
the distribution of observed frequencies in samples of size N may be of the 
Poisson type. The equation for the Poisson distribution may be written 


P(n) E m em [11] 
n! 


where m = Np and e has its conventional meaning. As in equations [3] and [4], 
the symbol P(n) represents the probability of n occurrences out of N possible 
occurrences. The symbol = is used in place of the equal sign to indicate that 
[11] is an approximation formula. The errors introduced by the approximation 


TABLE X 


ANALYSIS OF FREQUENCIES OF OCCURRENCE OF THE CONSONANT "TH" 
IN SAMPLES OF AMERICAN SPEECH 


F-—F) 
á F, P(n) F, (F,— Fi)? m 
0 31 .1868 22.42 73.61 3.283 
1 31 .3138 37.66 44.35 1.178 
2 30 .2636 31.63 2.66 .084 
3 11 .1476 17.71 45.02 2.542 
4 1 .0620 7.44 
5 3 .0209 2.51 
6 3 .0050 .60 41.22 3.896 
>6 .0003 .03] 
M^ 
= 120 1.0000 120.00 x? = 10.983 


are negligible, provided that N is quite large and provided also that N is vety 
much larger than the largest value of n that may reasonably be expected in 
random sampling. M . 

The x? test may be applied in relation to the data of Table X. The colum 
labeled F, gives the number of samples, in a total of 120 short samples ° 


n 
f 


the mean and standard deviation of the fitted function are estimated from the wee 

three restrictions are imposed and 3 df are lost. The same texts give a similar o $ 
retation of the number of restrictions imposed when x? is applied in testing indepe 

P 


ence. 


11 A derivation of this formula is given by Lewis (19, pp. 168-169). 
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American speech, that contained z occurrences of the consonant “th” (as in 
thin). Each of the 120 samples was 400 sounds in length. As seen from the 
table, 31 of the samples did not contain any “th” sounds; 31 samples contained 
one "th" sound each; and so on. As a first step, the data in Table X will be 
ared with some results obtained by Voelker (33). In a study of over 


600,000 sounds occurring in almost 6,000 announcements over the radio, 
"th" sounds to be .0065. Each of the 120 


Voelker found the proportion of 

samples represented in Table X contained 400 sounds. This made a total of 

48,000 sounds. The use of the proportion obtained by Voelker leads to 260 as 
“th” sounds among the 48,000. The 


the predicted, or theoretical, number of 
valuating the hy- 


observed number was 201.15 The x? test may be applied in e 
nt over-all sample, were drawn from a 


pothesis that the sounds in the prese 
general population of American speech sounds which is assumed to be char- 
acterized exactly by the value of m obtained by Voelker. The value of x* is 


comp: 


computed as follows: 

"e om (47,799 — 47,740)" _ 13,388 + 0,073 = 13.461. 

260 47,740 

Note that the observed and theoretical frequencies of non-occurrence of the 

"th" enter into the calculation. With 1 df, the obtained value of x? is signifi- 

cant at better than the 0.1% level of confidence; so there is a firm basis for 
rejecting the hypothesis. 3 i 

The observed proportion of .0042 (=201/48,000) may be used in testing 

the hypothesis that the 120 samples were all drawn from the same Poisson 

distribution. If p, —.0042 and N =400, then m = Np =1.68, and the equation 

for the hypothetical distribution function may be written 


a 68)" e788, 


P(n) = n! 
iven in the third column of 


Values of P(n) computed with this formula are given 1n 
ing values of F, are given 1n the fourth column. 


Table X, and the correspond : I 
As shown in the last column of the table, the value of x? was computed in ac- 
cordance with equation [10]. The last 4 values of Fi and the last 3 valke 15 
were combined to avoid the use of theoretical frequencies of less than 10. 2 
computed value of x? is 10.982. Five differences pues F, aid f bue a 

in the computations. Two restrictions (V and m)!? were poat in "i cu ing 
valuesof A. This leaves 3 df. With this name pothesis may, therefore, Pe 
falls at about the 3% leve bom? ding upon the level of confidence 


tentatively retained or may be rejected, depen 


that has been prescribed. 
HI-SQUARE TEST OF INDEPENDENCE 
? test enables an examination of the 


AprLICATIONS: II. THE c 
mine whether or not the 


A common application of the x 
frequencies of a contingency table to deter 
) 4-6(3) 2201. 


3 1) 4401) +53. =2 
ee por Mire, from the empirical data, but had been 
¢ from some oth: ependent source, the 
E placed wou nd there would have 


8 nF, =0(31) -1 (81) 2 

19 Tf the value of p had no 
taken from the Voelker study or 
Only restriction that would have been 
been 4 df instead of 3 df. 


er completely ind 
ld have been N, a 
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two variables or attributes represented in the table are independent. 
The number of cells in the table may range from four (as in a 2X2 table) 
to an indefinitely large value. The x? test is perhaps most commonly 
applied by psychologists in relation to 2X2 tables. The chief weaknesses 
in such applications are (a) a strong tendency to use excessively small 
theoretical frequencies and (b) an occasional failure to categorize 
adequately. The same two weaknesses are apt to occur when the 
number of categories in either, or both, of the 'dimensions" of the 
table is greater than two. 


Illustrations of the Chi-Square Test of Independence 


Comparison of coin- and die-coin guessing. To obtain data for illustrating 
the x? test of independence, 384 students of psychology were each asked to guess 
heads or tails on five successive tosses of a coin, where the tosses were inter- 
spersed among five throws of a die. The die was thrown; a guess was made as 
to the face that turned up. This guess was written on one edge of a sheet of 
paper. The edge of the paper was then folded under, to hide the guess. The 


TABLE XI 
RESULTS FROM COIN- AND Dir-CoiN GUESSING EXPERIMENTS 
— 
Fo F, 
Patterns z " ; 
Coin Die-Coin Total. Coin Die-Coin 
Guessing Guessing HH Guessing Guessing 
HHHHH 25 29 54 28.8 25.2 
A HHH T 12 9 21 11.2 9.8 
HHHTH 22 9 31 16.5 14.5 
HHHTT 18 5 23 12.3 10.7 
77 52 
HHTHH 29 38 67 35.7 31.3 
HHTHT 96 59 155 82.7 72.3 
HHTTH 22 24 46 24.5 21.5 
HHTTT 14 8 22 11:7 10.3 
161 129 
HTHHH 5 7 12 6.4 5.6 
HTHHT 5 9 24 12.8 11.2 
HTHTH 2 2 14 7.5 6.5 
HTHTT 4 9 13 6.9 eg 


36 27 


-— 
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Patterns 


Sub-totals 


H 
H 
H 
H 


dd44 


dd4d 
HANE 
Haas 
dau 
Aman 


4444 
Haas 
mium m 


Haan 
Han 
aaa 
34mm 
amm 


Sub-totals 


Totals 


H 
H 
H 
H 


sam 


3Jadmm 
SIST 


3a3mau 


F, 
Coin Die: Coin Totals 
Guessing Guessing 


33 
17 
12 


| 


34 


42 


12 
20 


39 


20 


82 


19 
17 


823 


Coin 
Guessing 


438.8 


F 


Die-Coin 
Guessing 


384.2 
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coin was then tossed, the guess being written down. Again the paper was folded 
under, to hide the guess. The die was thrown a second time, the guess made, 
the paper folded under. The coin was then thrown a second time, the guess 
made, the paper folded under. Each of the five guesses on the coin was pre- 
ceded by a throw of the die and a guess on its fall. The paper was folded under 
after each guess on the die and each guess on the coin. Thus, the guesses on the 
coin were not only separated by guesses on the die, but the sequence of guesses 
was hidden from view. The subjects were never informed as to how the coin or 
the die actually fell. 

The frequencies of occurrence of the 32 possible patterns of guesses on the 
five successive tosses of the coin (with guesses on the die ignored) are given in 
the third column of Table XI. The frequencies in the second column of this 
table were copied directly from Table VIII and are based on guesses on five 


successive tosses of a coin alone.?? The subjects for the two conditions of guess- 
ing were completely different. 


TABLE XII 


2X2 CONTINGENCY TABLE BASED UPON THE COIN- AND DIE-COIN 
GUESSING EXPERIMENTS 


First Guess 


H M 
Coin Guessing 339 100 439 
(312.6) (126.4) 
Die-Coin Guessing 247 137 384 
(273.4) (110.6) 
586 237 823 


The first 16 patterns listed in Table XI begin with a guess of heads, the last 
16 with a guess of tails. A 2X2 contingency table, shown in Table XII, was set 
up, the division along one "dimension" being between first-guess-heads a? 
first-guess-tails and along the other "dimension" between the two conditions 9 
guessing. The four sub-totals in Table XI constitute the observed frequencies 
appearing in the four cells of the 2 X2 table. 

The hypothesis to be tested in this case is that the occurrence of a first gues® 
of heads was independent of the condition under which the guessing was don € 
If the observed frequencies in Table XII were independent of the conditions 9, 


2) As will be seen, the 32 patterns in Table XI have been divided into groups wp 
patterns each. The basis for the division will be discussed later. Short horizontal ME 
divide the corresponding observed frequencies. A numeral is placed to the right on - 
of these lines. Each numeral is the sum of the observed frequencies for the correspo 
group of four patterns. Each is a sub-sum and will be used in Table XIII. 
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guessing, the probability of a guess of heads was 586/823. ility 

subject's being in the coin guessing group was bus. o Dr s 
that a subject would be in the coin guessing group and would also guess jeu 
was 439/823 X586/823. The theoretical frequency for the upper left-hand cell 
of the table was obtained by multiplying this joint probability by 823. The 
other three theoretical frequencies were automatically determined by this single 
and by the restrictions of the border sums. The four values 


calculated frequency 
The value of x? was computed as 


of F, are shown in parentheses in the table. 


follows: 
(137 — 110.6)? 


jii (339 — 312.6)? , (100 — 126.4)? (247 — 273.4)? 
110.6 


à 312.6 126.4 273.4 
1 1 1 1 

= (26.4 (—— + — + mou 

0o (zz n5a mat toe 


= 696.96 (.02381) = 165.946. 
The number of df is 1. This follows because there are four cells and because 
three restrictions were placed in determining the theoretical frequencies. The 
three restrictions were the total number of subjects (823) and two border 
sums, one for a row and one for a column. The calculated value of x? is highly 
significant and leads immediately to a rejection of the hypothesis that the con- 
ditions of guessing had no influence on the tendency to guess heads on the first 


guess. 
Table XI may be regarded as a 32 X2 contingency table, 32 “patterns” by 
2 “conditions of guessing.” In order to use the x? test in evaluating the hy- 
pothesis that the patterns were independent of the conditions of guessing, the 
border sums would be used in calculating theoretical frequencies. For example, 
the two theoretical frequencies for the pattern HHHH Hare given by the rela- 
tions: F, =(54 X439) /823 =28.8; and F,=54—28.8 =25.2 = (54 X 384) /823. 
These two values of F: are shown in the top row of the last two columns of 
Table XI. The theoretical frequencies for the other 31 patterns were obtained 
d in the table. These frequencies are included to 
applied to this table 
re less than 


asitstands. The reason is t 
10 (some of them very much | c 
yield quantities distributed as X^ To make a legitimate 
particular case, it would be necessary to increase th 

tical frequenc! 


point where the smallest of the theore 
her or not the observe T 
? test in evaluating the hypothe- 


It must now be decided whet ^ eh 
can be combined so as to permit the use 0 - ot 
sis of non-relationship between the patterning of the guesses and the conditions 

have already been combined in a gross way to 


of guessing. The frequencies crm à 
yield the Mita pet, uencies in Table XII. This division was on the basis 
u . might be made on the basis of the first two guesses 

It is not possible to use the 


of the first guess. Divisions 

Or the first three guesses OF the first four guesses: . 

Patterning on all five guesses because the theoretica : 

Small, as already seen. It turns out that a division on the 
84 and 237, o 


?! The sums 439 and 237 (or 3 ll but 
Stead of 586 and 439. The values of F; forall bu 


1 frequencies become too 
basis of the first four 


r 384 and 586) could have been used in- 
one cell may be obtained by subtraction. 
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and four in the other will be included. A total of 2,274 eighth-grade pupils, en- 
rolled in 91 different schools, took an English Correctness Test. À summary of 
the scores obtained by these pupils has been taken from a report by Lindquist 
(23). The scores were divided into five categories and symbolized by numbers 
from 1 to 5, as seen in Table XIV. The schools were divided into four enroll- 
ment groups, labeled A to D in the table. The observed frequencies in the 20 
cells of the table range from 3 to 342. These frequencies would obviously have 
been different if the enrollment groups had been differently established, and if 
the scores had been divided into different categories. The enrollment grouping 
of the schools was that commonly used in the Iowa Every-Pupil Testing Pro- 
gram.? The division of the scores was made by starting at the bottom an 
"stepping off" successive standard deviation ‘‘distances” (approximately). 
The distribution of scores was positively skewed; so it was necessary to combine 
the two upper score categories to provide satisfactorily large frequencies in 
the top row of cells. 

The theoretical frequencies for Table XIV were computed in the usual way 
when the hypothesis of independence is under test. The computations required 
the use of three column sums and four row sums, as well as the value 
of N(— 2274). This made a total of 8 restrictions; so the number of df =20 -8 
-2122(5—1)(4—1). The value of x? was computed as follows: 


a 06 = 23.5)? (76 — 60.2)? (88 — 83.5? | (75 — 47.3)* 
23.5 60.2 83.5 47.3 


With df=12, this value is highly significant and leads to a rejection of the 
hypothesis that the scores obtained on the test were independent of school size 

A comment should be made concerning the theoretical frequency in the 
upper right-hand cell of the table. Its value is 8.9. Ordinarily, a value this small 
should not be used in obtaining an estimate of x?. In this case, however, t 
other 19 values of F, are satisfactorily large, and the inclusion of one theoretic 
frequency that is less than 10 is permissible since an error in a single category 
will have slight effect on the resulting value of x2. The obtained value of yeo 
large that it makes no difference whether or not the small theoretical frequency 
is included in the calculations. Itis only in situations of this general kind that 
one or two small theoretical frequencies may be retained. When the number ° 
df is less than 4 or 5, and especially when df =1, the use of theoretical frequencies 
of less than 10 should be strictly avoided. 


x = 99.038. 


Use of the Chi-square Test with too Small Theoretical Frequencies 


The studies of Lewis and Franklin (21) and Lewis (20). The commonest 
weakness in applications of the x? test to contingency tables is the use of 
tremely small theoretical frequencies. This weakness is clearly present in mer 
of the applications made in a paper by Lewis and Franklin (21). The paper m 
concerned with the Zeigarnik effect (that is, with the relative amounts of rec 


: h c 
of interrupted and completed tasks). In one experiment, 12 subjects were en 


5 
presented with 18 problems, 9 of which were interrupted by the experime? the 
the other 9 being completed without interruption. The ratio (RI/RC) 9 

D: 


2 The enrollment categories were: A, greater than 400; B, 126400; C, 66-125: 


less than 66. 


— aur 
— — ; 1 
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number of interrupted tasks recalled to the number of 
numb $ : completed task: 
is given for each subject in the top row of Table XV. In a eh 


TABLE XV 


DATA UPON THE "ZEIGARNIK EFFECT” PRESENTED By Lewis (20), 
AxD LEWIS AND FRANKLIN (21) 


Subjects 


Ratio of Recall 
Scores 
1 2 3 4 5 


RI/RC—Group I 
(Indiv. Work 
Condition) 

RI/RC—Group II 
(Coop. Work 

Condition) 1.67 1.25 


1.00 .80 .80 .80 .75 si 63 57 57 44 40 25 


100 1.00 1.00 1.00 1.00 1.00 1.00 100 .80 .75 .60 .14 
k situation in which a co-worker 
nterrupted. The conditions were 
by her and Franklin. Fourteen 
ect is shown in the second 


Lewis (20) had employed a cooperative wor 
completed the tasks on which the subject was i 
otherwise the same as in the later experiment 

subjects were used. The RI/RC ratio for each subj 

row of Table XV. The median ratio for Group I was .67. The ratios for the 
two groups were divided (dichotomized) at this point. The 2x2 contingency 
table shown as Table XVI was the result. The hypothesis to be tested was that 
the conditions of the two experiments had no differential effects on the recall of 
interrupted and completed tasks. The theoretical frequencies are poem in 
parentheses in the table. The investigators calculated a value of x? in i ae 
way. Finding it to be approximately 3.8 and to fall near the 5% level of con- 


fidence incli j he hypothesis. 

, they were inclined to reject the hyp . "e 

, Entirely aside from any of the conclusions reached by Lewis and em 

it must be firmly stated that all four of the theoretical frequencies 1n able VI, 

and especially the two that are less than 5, are too small to warrant aei 

tion o 2 re, the other applications ma e in eir paper, 
f the x? test. Furthermo a aali ar neiesiihat bg 


with one or two possible exceptions, heoretical fr: 
be avoided (her ralital frequencies, for example, of 4.4, 6.0, 7.0, 7.6, etc.). 
TABLE XVI 


7 
BLE BASED upon DATA IN TABLE XV 


CONTINGENCY Ta 


12 18 
Greater than 0.67 M 3) (9.7) 

2 8 
Less than 0.67 an (4.3) 

14 26 


462 DON LEWIS AND C. J. BURKE 


Kuenne's study of transposition behavior. An application, which is correct in 
principle but which must be regarded somewhat unfavorably because of the size 
of the Frs, comes from a paper by Kuenne (18). Kuenne made a study of 
transposition behavior in four groups of young children, the ages for the groups 
being 3, 4, 5, and 6 years. Some of the children displayed size transposition, 
others of them did not. Kuenne realized that, because of the small number of 
cases in each age group, she could not apply the x? test to the data for the four 
groups considered separately. Consequently, she combined the results for ages 
3 and 4, and those for ages 5 and 6. The children were divided into two cate- 
gories—those who did and those who did not meet the transposition criterion. 
The resulting 2 X2 contingency table is shown as Table XVII. The hypothesis 


TABLE XVII 


CONTINGENCY TABLE BASED UPON KUENNE's (18) TRANSPOSITION DATA 


EE ES BES aer um eripe 


Age-Groups Transposition Non-Transposition Total 
3-4 years 3 15 18 
(9.4) (8.6) 
5-6 years 20 6 26 
(13.6) (12.4) 
23 21 44 


to be tested is that the occurrence of transposition behavior was independent of 
age. Theoretical frequencies were determined by using two of the border sums 
and the value of N. It will be seen that two of these frequencies are less than 10. 
Because they are fairly close to 10, many investigators would proceed as Kuenne 
did, and make a x? test of the hypothesis of independence. In fact, there are 
textbooks in statistics which place 5 as the minimum value for theoretical fre- 
quencies. A value of 5 is believed to be too low. In any event, it is the smallest 
value that should be used even when there are several other theoretical fre- 
quencies that are far greater than 10. 

The value of x? computed from Table XVII is 15.434. With 1 df, this value 
falls close to the 0.1% level. It is only because the value is so large that coni 
dence can be placed in the conclusion that transposition behavior was related to 
age. In view of the smallness of all four of the theoretical frequencies, very great 
doubt would have remained if the x? value had fallen at a border-line level © 
confidence. Whenever small theoretical frequencies enter into calculations 9 
x?, the experimenter has no sound basis either for accepting or rejecting a PY- 
pothesis except when the value is quite extreme. - 

Yates’ correction for continuity. No mention has as yet been made ofa Sor 
rection proposed by Yates (30) which reduces the value of x? to compensate ing 
errors which may arise as a result of one of the approximations made in deriv! ec 
the formula for the x? distribution. It will be recalled that three approximatio 
are made in this derivation. One of the three involves the substitution © 
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on of discrete quantities. This approximation introduces 
ia on) that is of consequence when values of F, 
mall. The correcti when the number of df is 1.5 It 


provides for the reduction of all differences between observed and theoretical 
KEEN by 0.5. For example, all of the differences between observed and 
vM frequencies in Table XVII are 6.4. These are reduced to 5.9 if 
ates’ correction is applied. The calculation, using the correction, would be as 


follows: 


integral for a summati 


an error (an error of overestimati 
ion is justified only 


1 1 1 1 
2 6.08 (see tase tag) 7 62069169 = 
a= tt (sitse tie ma (34.81.3768) = 13.1. 
un 1 df, this value is still highly significa 
ypothesis. But the use of Yates' correction 
theoretical frequencies that are less than 10. 
rect Categorising 


nd completed tasks. Another weakness 
2 test to contingency tables is 


nt and leads to a rejection of the 
does not remove the objection to 


Questionable or Incor 


Lewis’ (20) study of recall of interrupted a 


TEM is sometimes present in applications of the x 
that the categorizing is done on either a questionable or à clearly incorrect 


basis. An illustration of incorrect categorizing is found in the paper by Lewis 
(20) discussed above. The RI/RC ratios for 14 subjects were obtained in a 

cooperative work experiment.” The ratios are the ones given in the second 
row of Table XV. On the assumption that the recall of interrupted and com- 
pleted tasks should have been the same, Lewis writes: ^... We should have 


an equal number of ratios above 1.00 and below 1.00. . .. The expected distribu- 
tion of ratios should, therefore, 7 at 1.00 or above. The 


be 7 below 1,00 and 
ahteined O ron of ratiós is below: 1:09 and 10 at 1.00 or above.” The 
categorizing is plainly wrong; there is no more reason for placing an obtained 


ratio of 1.00 in the upper category than for placing it i the lower category. 
better procedure AA have been to ios of 1.00 equally between 


I divide the 8 ratios 0 a 
the two categories. This flaw in Lewis remindful of the 


4 division of the ratios 1° ful 
belief of many graduate students in psychology that it is quite A ee © 
Set up several different se Pasi 


ts of dichotomy 1 n Ed 
Set, and finally select the dich ield a x* value to support the experi- 


otomies that y! ; lue tone x2 test e apes 
menter" i new. investigation whe x : 
e etal shaft lished in à logically defensible and reliable 
manner—befo ta are collected, if possible. : 
isdem eh ae dte oer a ey 
1e whole pr: tegorizing m ought í : 
taking an poen ue a a study bY Anastasi and Foley (1). These two 
investigators had auk uf H nd 340 abnormal Subject r 
ptum which depicted danger 2 n divided into the 
ubject-matter categories listed 1" Table X Dd 
Yields a value of 99.603 which, wi 19 df, falls far DEP che subject 
Confidence. This leads to a rejectio [ the hypothesis eid the subjec matter 
E "TH ” 
of the drawings was independent © kinds” of subjects. 
:£ several values of x? are to be summated. 


23 
The correction should no b alues. 


Ti n 
he additive principle does not apply 
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TABLE XVIII 


DATA FROM ANASTASI AND FOLEy’s (1) STUDY or DRAWINGS OF 
NORMAL AND ABNORMAL SUBJECTS 


F, F, : 


Subject-Matter Categories (Abnormcl) (Normal) 


1. Traffic 105 124 114.5 
2. Conventional sign or signal 36 37 36.5 
3. Skating, ice 5 10 745 
4. Falling 21 17 19.0 
5. Drowning, sinking, flood 8 10 9.0 
6. Falling objects, explosion 6 6 6.0 
7. Arms and explosives 10 22 16.0 
8. War 3 7 5.0 
9. Fire 26 37 31.5 
10. Lightning, electricity 6 15 10.5 
11. Animals 3 16 9.5 
12. Abstract or conventionalized 
symbolism 3 8 5.5 2.273 
13. Fantastic compositions 26 0 13.0 26.000 
14. Several discrete objects 9 2 5.5 4.454 
15. Scribbling or scrawl 6 0 3.0 6.000 
16. Writing only 12 1 6.5 9.308 
17. Miscellaneous 23 26 24.5 .184 
18. Recognizable object not rep- 
resenting danger 8 1 4.5 5.444 
19. Refusal to draw 14 1 1.8 11.267 
20. No data for other reasons 10 0 5.0 10.000 
eil -— E Mss" 
340 340 99.603 


Let it again be emphasized that the criticism here, as elsewhere in the pape 
is not directed at any of the conclusions reached by the investigators. But the 
illustration provides a very satisfactory basis for discussing the fundament@ 
problem of categorizing. In the published article, the principles adopted 1" 
classifying the pictures are not explicitly stated. Furthermore, evidence is n 
presented regarding the reliability of the categories. Two generalizations may 
be offered. The first is that, whenever possible, categories for frequency data shoul 
be established on the basis of completely external criteria (for exa mple, criteria tha 
have been used or proposed by some other investigator) and should be set op 
independently of the data under study. Such a procedure frees a person aee 
any charge of bias and guards against tendencies to juggle data. A second £^ 
eralization is that information on the reliability of categories should. be offero 
and this is the case whether or not the categories have stemmed from an !? 
pendent source. the 

A study of Table XVIII shows rather quickly that the value of x? for ing 
drawings depicting danger would have been quite different if the catego!” 
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had been different. For example, the amount i 

value of x? by the frequencies of category 13 — Beim eae a be id 
! The decision which established this category Bude 
f the drawings of the abnormal subjects in the 
of the normal subjects in it, should have been 
atement concerning reliability should have 


been included. The discrepancy between the frequencies in category 13 (alon 
with the discrepancies in such categories as “refusal to draw" and “no data for 
other reasons") required that there be discrepancies in one or more of the other 
categories. Unreliability at one point in a multicelled table automatically pro- 
duces unreliability elsewhere. 

It is well to emphasize, by reiteration, that when the x? test is to be applied 
to a collection of data, the categories should be established independently of the 
data and, once established, should never be modified on the basis of the way 
the data happen to fall. Categories should usually, if not always, be established 
before the data have been scrutinized. 
Goopness OF Fit oF FUNCTIONS IN 
NT VARIABLE 


‘fantastic compositions.’ 
judgments which placed 26 o 
category and none of the drawings 
explicitly justified and a precise st 


APPLICATIONS: II]. THE 
WHICH FREQUENCY Is THE DEPENDE 


tive frequency) of occurrence of a response is 
dent variable in psychological experiments. 
vestigations based on the method of 
oportion of judgments in a given 
while in studies of the con- 
ce of the CR is often taken 
is fitted to data 
t in evaluating 
rect 


The frequency (or rela 
sometimes used as the depen 
For example, in psychophysical in 
constant stimuli, the number or pr 
direction serves as the dependent variable, 
ditioned response, the frequency of occurren 
as the dependent variable. If a mathematical function 
of this type, it is sometimes possible to apply the x? tes 
the goodness of the fit. However, care must be taken to insure a cor! 
application. 

mmon practice 


mma function. It is a fairly co 
tween observed 


Goodness of fit of the phi-ga 1 
among psychophysicists to apply the x? test to the differences be à 
and theoretical proportions in order to evaluate the phi-gamma hypothesis— 
the hypothesis that the observed proportions can be represented satisfactorily 
by the phi-gamma function.” The usual procedure in making the test will be 
illustrated with some weight-lifting data. 

As part of a laboratory exercise, à ET 


mental ade judgments on € 
course made 100 Judg the X column of Ta 


duate student in an advanced experi- 
ach of nine pairs of weights. The 
ble XIX. The method of 


Wweigl H * ‘ 
ghts, in grams, are listed in The 100-gram weight was the standard and 


Constant stimuli x 

stin loyed. : n 

Was paired ele ue EE yaly with itself, but with each of the other eight 

à si ‘dard weight was always lifted first, the variable 
erms of the question: 


'variable") weights. The stan 

Weight sec T 00 jud ts was made in t 

T ond. Each of the 900 ju gmen 

hich weight is ix delis the first or the second? The second column of the 
explained by 


Guil The use of the phi-gamma function in psychophysical research is 
ilford (13, Chap. VI). 
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table, labelled i 
t y p» gives the proportion of ti i 
"1 k 
ud ms than the standard weight BoSS CUE eae QE 
i-gamma function may be fi t 

"me j nay be tted to the proporti i x 

oe | Mere then be applied (with a reservation to pei: eon ee 
ness of fit. The phi-gamma function is basically the = ad 

) e 


equati i i 
quation for the normal ogive, which may be written 


z 1 
"T tit 
witl b m i * 
1 p standing for proporti 
Ac à proportion of area under the normal curve f - : 
sual, x represents a deviation of the variate X from the sois de x. pe 
ymbol + for the quantity 1/c 4/2 yields 


substitution in equation [13] of the s 
[14] 


"E 

- Le 

= GE e? dx. 

n th 1 * H p . 

Sarion hd x =X—L, where Lis the value of X corresponding to a pro- 

ceive on «50. The constant his an index of the steepness or “precision” of the 
rve. The phi-gamma function, written directly from [14], takes the 


form 
LER 
paf yee 115) 
My g 
sig y has been substituted for hx. 
ii&c n fitting the phi-gamma function to a set of observed proportions, it is 
Sue oe to obtain estimates of the constants + and L. Since y=/« and 
X —L, it follows that 
a4 - AX — hL. [16] 
X. The principle of least-squares is invoked in 
hand L may be 


Thi — 

iesus is linear in y and 

ae following two equations with which estimates o 
xXxX-XExLX. u 


NEXE? 
he We (LX 


xxi-Xàr 
Bei t 
Nh 
the quantity that i 


Y (y — AX + AD* 
and [18] can be solved, values 0 


table of the phi-gamma function 
[15] for values of p. The ta 


18 


I . $ areas 
n the derivation of these equations s minimized is 
E- qy) = 

Before equations [17] f y must be obtained. 
hey may be found in a (that is, in a table 
ased on solutions of equation ble is entered with 


Values of p,. 
eight is significantly greater 


fine The reason that the value of po for the 100-gram w m 
sist -50 is that a negative time error was operating. This means that there was a con- 
ent tendency for the subject to over-estimate the weight of the second member ofa 

all comparisons, but was most immediately apparent 


Pair, Th 
E. e tendency was present in BER. 
Tom the value of p when the standard weight was compared with itself. 

il d values of $ and y) are not 


ST 3 ipe: j- 
abl e phi- tion which give paire 
les of the phi-gamma functi ( Roa — 


u; H " 
Sually included in statistics books published in the United 
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Values of y corresponding to the nine values of p, in Table XIX are given in 
the third column of the table. The fourth column gives values of Xy, the fifth 
column values of X7. The four sums needed for solving equations [17] and [18] 
appear at the bottom of the table. Estimates of constants / and L are computed 
as follows: 

2 i 
p = 62.10 — 267869. o 152; 
9(83,184) — (864)? 
L= (864)(.152) — (2677) = 95.8. 
9(.152) 


The insertion of these estimates into equation [16] gives 
yı = 0.152X — 14.562. 


This equation was used to calculate the values of y, shown in Table XIX. The 
corresponding values of p, were read from a table of the phi-gamma function. 

The x? test may be applied (with a reservation) to test the goodness of fit, 
that is, to test the hypothesis that the difference between the observed and 
theoretical proportions arose from sampling fluctuations and, consequently, that 
the observed proportions may be satisfactorily represented by the phi-gamma 
function. The procedure for calculating the value of x? based on all nine theo- 
retical proportions is illustrated in the two columns of Table XIX denoted by 
(A). Values of (p.—p.)? were first obtained and were then divided by the 
product pg: This single division by pig: took account of differences between 
b. and f, as well as differences between q, and qı (where, as usual, qo =1—Pe 
and q:=1—p,). It can easily be shown that 


(be — pi)? a (qo — 9) _ (Po — pr)? 
hi qi i Pai i 
Emphasis is here given again to the fact that in the calculation of x?, account 
must be taken of the frequency of occurrence and the frequency of non-occur- 
rence of an event. 
The sum of the second column under (A) is 021767. This is not the value 
of x*. However, it becomes the value of x? after it has been weighted by 100, 


the i n of judgments upon which each value of p, was based. In other 
words, 


(bo — pu)? 
z Pige 


where 7, is the number of judgments per proportion. The number of df is 
7=9-—2. It is two less than the number of rows in the table.? The compute 


=n 


= 100 (.021767) = 2.1767 


to obtain values of y indirectly from an ordinary table of the probability integral. 
Since z=x/o; y=hx, and h=1/ev2, it follows that y=.70712 and z —1.4142y. 

21 The number of degrees of freedom for a table such as XIX is readily determined bY 
starting with the number of rows. Each row contributes 2 df before any restrictions are 
imposed. (This is because each row actually covers two pairs of proportions. The pair 
which is omitted in the formula is g; and gs.) The restriction that (p:-+q:) —1.00 i5 im 
posed on each row; so 1 df is lost for each. Further, 1 df is lost for each constant estimate 
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value of x? falls near the 95% level of confidence; so the phi-gamma hypothesis 


cannot be rejected. 

The applicability of the 
the x? test in the manner illu 
tion, one relating to the size o 
relating to the restrictions imp 
are considered, a few comments will be m 
that were imposed, although it must be sai 


restrictions is mathematically rather complex 
paper. Whenever a curve is fitted to a set of empirical data, there are usually 


several acceptable methods of determining values for the constants to be in- 
serted in the equation. Itis relatively easy to show that some of these methods 
lead to values of "x5," as computed by the familiar formula, which tend to be 
consistently larger or smaller than the values obtained when other curve fitting 
methods are employed. Therefore, when a value of x? is computed to test the 
goodness of fit of such functions as the phi-gamma function, the method used 
to estimate the constants becomes à matter of importance. The computed 
quantity is distributed as x? only when the restrictions imposed during curve 
fitting are both //near and homogeneous. (The meaning of these terms is ex- 
plained more fully in a later section.) In many instances of curve fitting, and 
the present one is a case in point, the restrictions that are imposed 90 not e 
the requisite properties. When this is true, the investigator must be especially 


cautious in interpreting his results. : " 
Se ee unde at now be given bs thé ger eden A9 i 
resented by the theoretical proportions 1n Table XIX. : bed 4.0 "These 
the first row corresponds to two theoretical frequencies: A m E nae - 
values are estimates of the number of judgments game o petes e 
judgments "lighter" that the subject should have made w Pole cna 
104-gram weight with the standard weight. T! 4.7 d 95 3. These fre- 
row also corresponds to theoretical frequencies of, + a fect ghould have 
quencies indicate that, according to the fitted function, the su (a the bearem 
said “heavier” 4.7 times and "lighter" 95.3 times veg ETATS too small to 
weight with the standard. The two frequencies of a been used in the calcula- 
fen quantities distributed as a and eba sas a means Monroy ihe 
ions. They were included 1n i: e proportions should 
Correct with the incorrect procedure. ed ciple XIX Mns the results 


have been combined with adjacent p I sed on .97 and .90, d 
under the general head- 


last statement rests upon the validity of applying 
strated. There are two weaknesses in the applica- 
f two of the theoretical frequencies and the other 
osed during curve fitting. Before the frequencies 
ade on the nature of the restrictions 
d that a meaningful explanation of 
and lies beyond the scope of this 


after the combining has been done. N | 
x E abeled po 
also on .13 and 04, are shown es aem pam are given in the column labeled 
heoretica’ Pr). and .085) are averages based on 200 


ing (B). Corresponding t 

Pi. The two combined values of Po Epp calculated for each must be 
judgments each and the quantity Y cause of this differential weighting, it is 
e. du of the table, before the 


weighted by 200 instead a shown in the last column 


better to multiply by #* 2 
o constants (/t and L) must be 


5 rocess. Since tw : 
sapie n the number of rows (or the number of pairs 
is 2 les + 


Iculations. 


from the data during the cu 


estimated, the number of df 
of 5, and p.) that enter into the ca 
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The study was designed to evaluate the theory of the — —— 

ditory discrimination. The theory required, among other thing L4 io De 
ub rs 4 oportions for a subject could not be adequately represented bi bes 
erwies uii o. Consequently, phi-gamma functions were fitted poem 
^ra six sets of proportions in Table X X. All proportions giuste ee bt 
less than .03 were omitted during the curve fitting Process. A faw : - pitfalls 
portions later played a part in the calculations of x?. This is gne si UE m 
to be carefully avoided. Another important aspect of the € ürve: 1 at daw 
cedure was that the Müller-Urban weights were utilized. The eere EL 
weights and the detailed procedures for treating weighted data ara ac ea aged 
explained by Guilford (13) and need not be discussed here. It is a re 
that the weights are designed to diminish the influence of extreme proj 
in the computations of the constants h and L. 


TABLE XXI 


T : " TE ) 
VALUES OF h AND L SECURED ny Srey INS, MORGAN AND VOLKMANN (32 


Subject h L " 
JV 0.8520 3.9120 
GS 1.2098 3.0525 
RR 0.8476 4.1051 
JM 1.0640 2.9001 
DM 0.8754 3.5885 
MJ 2.0432 1.6194 


fhand L for the six fitte 


were used in the m 
theoretical Proportions for e 


proportions used by the inv 
he six calcula 


d functions are Shown in Table XXI. 
anner illustrated in Table XIX to altan 
ach set of observed Proportions. The ile 
estigators in calculating values of x? are shown A 
ted values of X! are given in the first row of Tab 


required that the Proportions for the sev vil 

ht-line functions, The results for the straight-line fits V 
Proportions in Table XX 
The x? test could then be 


ner exactly an 


alogous to th 
th phi-gam i 


at employed when theoretical prope 
Whenever a comparison is to be ia be 
between the goodness of fi o different functions, the same quantity should ly. 
minimized in the Process of obtaining the constants for the functions. In their stut b- 
Stevens, Morgan, and Volkmann min; sum of squared differences between 9 
served and theoretical values o i ing phi-gamma functions but epe 
the sum of squared differences d and theoretical proportions in n 
iven by Cramér (5, p. 426 ff.), the ym 
ould be expected to be larger than t 


ng toa theorem g 


mma functions w 
obtained with the straight lines, 


o n 
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XXII, along with the number of df for each. (The other two rows of values į 

this table will be explained later.) Only one of the values in the ees rt ie 
value for subject RR) is sufficiently large to justify the rejection of E tx 
gamma hypothesis at better than the 5% level of confidence. However. t ; i 
separate values of x? are completely independent estimates and, theretore ; $ 
be summated to provide a single composite value of x?, the number of if fo. 
Which is the sum of the six separate df's. The composite value appears hie 


TABLE XXII 


VALUES or CHI-SQUARE BASED UPON THE Data OF STEVENS, MORGAN 
AND VOLKMANN (32) 


Subjects 


JV GS RR JM DM MJ Comdariw 
x* 7.64 4.02 10.35 2.97 5.90 2.92 33.80 
df 5 3 3 3 3 21 
x? 1.94 0.13 4.06 2.97 5.90 1.02 19.02 
df 1 2 3 3 3 2 17 
xt 1.70 0.13 3.14 2.87 2.90 1.02 11.76 
df 2 1 2 2 2 10 


TABLE XXIII 


Snowing ri EFFECTS UPON CHI-SQUARE VALUES OF USING OBSERVED PROPORTIONS 
Wuicu WERE EXCLUDED IN THE DERIVATION OF THEORETICAL 
PROPORTIONS (DATA FROM TABLE XX) 


(4) (B) (C) 
: Q—po (po— Po)? (b. — b0* 
P ğ a — bi)? (bo= Ps)? MO np E 
b hi (bo — bi ^ p Po pt k pos 
3.0 (.02)  .093  .005329 .0629 = = = 
3.5. .21 — .234  .000576 -0032 .0032 — .21  .234 .32 
4.0  .52 .450  .004900  .0201 .0201 52 — .450 2.01 
4.5  .64 .682  .001764 -0081 .0081 64 — .682 8 
5.0.81 — .858  .000324  .0027 0027 
; : snos ep s .905 .905 
5.5.97  .953  .000289 .0065 .0065 5S us .00 
.1035 .0406 3.14 


From Column A: x? 2100 (.1035) =10.35, with 4 df 
From Column B: x?— 100 (.0406) = 4.06, with 3 df 
From Column C: x? 3.14, with 2 df. 


— 
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3 z esína fuil ad t 
last column of the table and is 33.80, with 21 df. kr vum han ni 
the 3% level of confidence and indicates that the fits of the phi-g 
sig ne corn this application was that four of the extreme ahietesd 
proportions, no one of which played any part in determining constants h e 
were included during the calculations of the values of x?. A specific — x 
clarify the point. The observed and theoretical proportions for subject RK = 
listed in the second and third columns of Table XXIII. The observed prop à 
tion .02, shown in parentheses, was purposely excluded by the investigators 
when making least-squares solutions for and L. Once obtained, these zik 
stants were crucial in determining the theoretical proportions. Che investi- 
gators decided to include all observed proportions whose corresponding theo- 
retical proportions were neither greater than .97 nor less than .03. This ment 
including the observed proportion .02 in the calculations for Table XXIII. It 
also meant the inclusion of observed proportions 1.00, .00, and .99 in the calcula- 
tions for subjects JV, GS, and MJ, respectively. n 
As shown in Table XXIII, the estimated value of X? was 10.35 when p, = 02 
and p: =.093 were included in the calculation. If this pair of proportions is not 
included, the estimated value is 4.06. The numbe 


r of df for the first estimate is 
4, for the second, 3. There c 


an be no doubt that the second estimate is the bet- 


ter of the two. In the calculation of xX’, itis incorrect to use observed proportions 


which have not been allowed to influence the magnitude of the theoretical pro- 
portions. 


The values of x? shown in the second row of Table XXII were obtained by 
excluding the observed proportions which played no role in the curve-fitting 
process. There are still six independent values, each with its own number of df. 
It is entirely legitimate to add these values to obt j 
19.02 (with 17 df) given in the last column of the table. This new composite 


value falls at about the 3097 level and provides no basis for rejecting the phi- 
gamma hypothesis for the six sets of proportions. 


Still another flaw in the calculations of the values in Table XXII was that 
theoretical proportions representing theoretical frequencies of less than 10 were 
not first combined with adjacent theoretical proportions. 
theoretical proportion .953 (in Table XXIII) represents theo 


of 95.3 and 4.7. A theoretical frequency of 4.7 is too small to yield a quantity 
distributed as x?. Therefore, theoretical proportion .953 should have bee" 
combined, as shown in Table XXIII, with the theoretical proportion .858 t9 
give 5, —.905. The modified value of x? is 3.14, with 2 df. This value appears fo! 
subject RR in the third row of Table XXII. Similarly modified values for ihre 
other subjects are included in the same row. Again there are six independen 
estimates of x?, and these may be added to obtain the single composite € 
of 11.76, listed in the last column of the table. With 10 df, this estimate fa 
around the 30% level. 


ain the composite value o 


For example, the 
retical frequencies 


It will be realized that the elimination of the small and 1 3 
makes impossible a really critical evaluation of the phi-gamma hypothesi 
through the use of the x? test. But this is not a fault of x?; it is a weakness pd 
the experimental data. Whenever x? is to be employed, the experimenter MU 
take precautions to insure theoretical frequencies of adequate size. —9" 

The principal pitfalls in the use of x? with proportions have been designa 


arge proportion? 


4 
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Two others should be mentioned—the tendency to divide (p.—p:)* by p: alone 
instead of by the product Pig: without including frequencies of non-occurrence 
and the tendency to neglect to weight by the number of judgments upon which 
the proportions are based. 

_ An application of x? to percentages (Grant and Norris, 10). A recent applica- 
tion of the x? test to percentages (or relative frequencies) is taken from a paper 
by Grant and Norris (10). These investigators were concerned with the a 
fluence of different atnounts of dark-adaptation on the sensitization of the beta- 
response of the human eyelid to light.” One of their measures of degree of 
sensitization was the frequency of occurrence of the response. A subject looked 
straight ahead into a small box-like enclosure which was painted flat black in- 
side and out. The stimulus was a small circle of light emitted from a circular 
milk-glass plate, 10 cm. in diameter, located at the back of the enclosure. When 
illuminated, the plate had a surface brightness of 241 millilamberts. The dura- 
tion of the stimulus was about 730 milliseconds. 

Four experimental conditions were employed, all subjects serving in each 
condition. The conditions differed in the amount of dark-adaptation present 
in the subjects. The amount of dark-adaptation depended upon the total 

in darkness. The measure of amount of adap- 


length of time the subjects spent 1 
tation was the product Jt, where Z was the surface brightness of the stimulus 
plate and £ was the number of seconds spent in darkness. The 7/ products for 


the four conditions were 28,920; 187,980; 347,040; and 506,100. The corre- 
sponding values of ¢ were 120, 780, 1414, and 2100 seconds. 
TABLE XXIV 
DATA FROM THE EXPERIMENT OF Grant AND Norris (10) 


Conditions 


4 


P, 13.2 30.0 45.6 51.5 
P, 11.05 36.10 44.30 49.35 


Thirty-three subjects participated in the experiment. A subject was first 
dark-adapted for 120 seconds. The stimulus light was then presented four times, 
with a “control” trial between the first two and last two presentations. The con- 
trol trial served as a check on possible conditioning. The four stimulus trials 
were separated by dark intervals of 35 sec. on the average. The first four 
stimulus trials (coming after 120 secs. in darkness) were the trials for condition 1. 

ht was again presented four times along 


Condition 2, in which the stimulus ligh 
With a control trial, came after the subject had spent a total of 780 secs. in 


darkness. Conditions 3 and 4, with the stimuli presented in the same general 
fashion, came after 1414 and 2100 secs. in darkness. . 
the thirty-three subjects was given four 


. It should be noted that each of e $ 
Stimulus trials in each of the four dark-adapted conditions. A count was made 


29 The beta-response is one of two reflexes displayed by the eyelid when the eye is 


light-stimulated. 
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of the number of beta-responses occurring in each subject eee 
The results were then combined to provide frequencies of oc 2 ena 
response for each condition. These frequencies were used in com] , Pelis 
centages. The four observed percentages are shown in the first n E e 
XXIV. They indicate the relative frequency of occurrence of the beta-re D : 
in each of the experimental conditions. In condition 1, for example, 13. e 
the 132 possible responses of the eyelid displayed the beta-response. The othe 
three percentages may be similarly interpreted. : . ji; dii 
A logarithmic function was fitted to the data. The two variables were 74, a : 
amount of dark-adaptation, and P,, the relative frequency of occurrence of the 
beta-response. The fitted function was: P, = 13.38 log, Jt —126.39.39 Its solution 


for the empirical values of Jt yielded the values of P, given in the bottom row of 
Table XXIV. 


A value of x? was calculated as follows: 


a — (13.2 — 11.05)? | (30.0 — 36.10) (51.5 — 49.35)? 


T a 1,579. 
11.05 36.10 = 49.35 


Two constants were estimated from the data, leaving 2 df. With this number of 
df, the obtained value of x? falls near the 40% level of confidence. This led the 


: A 2 ; sented by à 
investigators to conclude that the data could be satisfactorily represented b» 
logarithmic function. 


There are four mistakes in this application of the x? test: two computational 
mistakes and two “theoretical” mistakes. 

discussed first. The calculated value of X* was not corrected to take account v 
the use of percentages instead of frequencies. It should have been multiplied by 
the ratio 132/100. The second computational mistake was a failure to take 
account of the frequency of non-occurrence of the beta-response in cach of the 
four conditions. In condition 1, for example, the beta-response occurred 13.270 
of the time and failed to occur 86.8% of the time. This] 
no part in the calculation of x?. All four percentage 
have been employed. The value of x? for T. 
puted, is 3.081. The number of df is still 2. 


: : sss of 
The other two mistakes were more basic. The use of x? to test goodness 
fit was not warranted, for two reasons. 


In the first place, there were linkage? 
within conditions. Each of the subjects was given four trials in cach of the fou" 
conditions of dark-adaptation. The results for the subjects were pooled. T hen 
were undoubtedly individual differences in capacity to display the beta-re; 
sponse; so there must have been linkages within conditions. In applying the aA 
test, the investigators assumed, in effect, that the 132 trials per condition wer 
given to 132 instead of 33 subj 


e as- 
ects. The test was inapplicable because the 2 
sumption could not justifiably be made. 


" a om 
The second reason that the x? test should not have been used arises ÍT 


a 
the lack of independence from condition to condition. As already stated, 
fundamental requirement for the use of the x? 
individual measures. 


The computational mistakes will be 


aved 
atter percentage play " 
s of non-occurrence shou 
pg ane 
able XXIV, when correctly co! 


2. % jeen 
test is independence be ai 
The differences between individual subjects that pene 
fested themselves in any condition were certain to be maintained in the © 


: : 7 are 
conditions; so it cannot be assumed that the values of P, in Table XXIV 
unrelated. 


3 With common logarithms, the function becomes: 
P, 30.81 logi, 71—126.39. 
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APPLICATIONS OF THE x? TEST TO NON-FREQUENCY Data 


Some investigators are prone to use a x? test of goodness of fit when- 
ever a set of observed and theoretical values of any kind is available for 
comparison. This mistake apparently grows from a misinterpretation 
of the well-known formula for computing x?: 


(I, cq 


F, [10] 


v= >> 


Because this equation involves differences between observed and 
theoretical values, the conclusion is reached that the summation of the 
weighted squares of differences between observed and theoretical quanti- 
tics yields a meaningful estimate of x?. Formulas superficially resem- 
bling equation [10] have been applied to non-frequency data, where 
theoretical values have been obtained from fitted curves. 

Suppose that a study has been made of the amount of activity dis- 
played by several groups of white rats deprived of food for differing 
lengths of time. Enough rats have been included in each deprivation 
group to provide means that are quite stable. These means, when plotted 
against time of food deprivation, show a systematic trend; so a curve 
is fitted to them. The equation for the curve permits the calculation of 
theoretical means which may be compared with the observed means. 
To test goodness of fit, differences between the observed and theoretical 
Zach difference is squared and then divided by the 


means are obtained. I 
The sum of these weighted squared 


appropriate theoretical mean. 
differences is taken as a meaningful estimate of x°. 

Two of the fourteen papers (4, 16) referred to in the opening para- 
graphs contain applications of this type. The fallaciousness of such 
applications becomes obvious when it is realized that values of x? com- 
puted from non-frequency data vary in magnitude with the size of 
the units employed in measurement. Assume that two investigators 


have a common aim: To determine how the height of human males 
They make measurements of the same individuals 


fit equations of the same form to their data, use 
these equations to compute theoretical values of height, and then obtain 
estimates of x? in a manner similar to the one described above. One of 
the investigators has measured height in centimeters, the other in inches. 
Except for incidental discrepancies, the value of x? calculated from the 
centimeter data will turn out to be 2.54 times the value calculated from 


the inch data. : : . : 
The x? test of linearity of regression. A x? test is recommended in certain 


textbooks (26, p. 319; 12, p- 237) as suitable for use, with non-frequency data, 


varies with age. 
In various age groups, 
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in evaluating linearity of regression. The formula, as usually presented, is as 
follows: 


2 
gut wa (19] 
le 

In this formula, 7 stands for correlation ratio; r for product-moment coefficient 
of correlation; N for the total number of measures; and £ for the number of 
columns (groups) into which the measures have been divided. The number of 
df is k—2. The formula yields a variable, the distribution of which approxi- 
mates the x? distribution, under certain conditions. It can be used with some 
degree of confidence provided that N is quite large and & is quite small, and 
provided also that the measures from column to column are independent and 
homoscedatic as well as normally distributed. 

An exact test of linearity of regression, which is applicable whenever equa- 
tion [19] is, can be made by computing an F-ratio. The formula is 


Lo = nu = 5 
(= aE = 2)" 


[20] 


The df's for this F are (E —2) and (N —k). The reason that the x? test, as de- 
fined by equation [19], can be substituted under any circumstances for the F 
test, as defined by equation [20], is that the distribution of F approximates the 
distribution of x? when one of the df's for F is very small and the other is very 
large. The particular x? distribution that is approximated is the one for the 
smaller df (that is, for df =k —2). To state the point in another way: The 
sampling distribution of estimates of F, obtained with equation [20], approxi- 
mates the sampling distribution of estimates of x? obtained with equation [19], 
provided that & is very small relative to N. 

Inasmuch as the x? test, as represented by equation [19], is inexact, while the 


F test, as represented by equation [20], is exact, nothing is gained by using the 
x? test. 


SPECIAL PROBLEMS: I. INDETERMINATE THEORETICAL FREQUENCIES 

It sometimes happens, despite superficial indic 
that meaningful theoretical frequencies cannot 
set of observed frequencies. This situation commonly arises from a lack 
of independence between measures, Two illustrations will be presented 
to reveal some of the chief sources of difficulty. 

First illustration. The first illustration is concerned with coin-guessing data- 
Two hundred and forty university students each made a guess of heads or tails 
on four successive tosses of a coin. They recorded their guesses on individual 
record sheets. They were not told in advance the number of tosses that woul 
be made, nor were they told how the coin turned up on the four tosses until the 
record sheets had been collected. The turns, in order of occurrence, wer 


ations to the contrary: 
be determined for à 


a A fuller explanation of equation [20], t : à " " ther 
F tests of goodness of fit, is [20], together with a detailed discussion of o 


given by Lewis (19). 
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H TX H. The succession of guesses of each student could easily be c 

with this succession of turns and the number of correct guesses ‘for e "dee 
be tabulated. As expected, the number of correct guesses ranged from E 
The results are summarized in the first two columns of Table XXV. As DI 


TABLE XXV 
ANALYSIS OF CoIN-GUESSING Dara 


Number i cr Capt 
me f P(n) rR ——— Ph’ R/ li 
F F/ 
0 15 .0625 15 .000 .0490 11.76 .893 
1 47 .2500 60 2.817 .1680 40.32 1.107 
2 60 .3750 90 10.000 .3129 75.10 3.036 
3 86 .2500 60 11.267 .3289 78.93 .633 
t 32 .0625 15 19.267 .1412 33.89 .105 
25 240 1.0000 240 43.351 1.0000 240.00 5.774 


15 students made no correct guesses; 47 made one correct guess; 60 made two 
correct guesses, ete. With frequency distributions of this type, it is common 
practice to apply the chi-square test after a binomial distribution function has 
been employed to calculate the required theoretical frequencies. The function 
for the present case, modeled after equation [3], would have the form 


Be Ri (z) (>) = <a n)! (D B^ 


and would yield theoretical pro 

. Values of P(n) calculated wit 
Table XXV with corresponding t 
Column. If these values of / were 


ee as illustrated in the fifth co 
hypothesis that the observed frequencies are distributed in accordance with 


equation [3’]. In effect, this would be testing the hypothesis that each guess of 
every student was a purely chance occurrence, completely independent of every 
Other guess, the probability of a guess of heads always being .50. The number 
of df is 4. If the hypothesis were true, the obtained value of x?, 43.351, would 
not be expected to arise in random sampling once in a million times. 

The use of a binomial distribution function in calculating theoretical proba- 
bilities cannot be justified in this case. There is good reason for believing that 
the guess of a student on any of the last three tosses was not independent of 
Previous guesses. The binomial distribution function is applicable only when 
there is a sound basis for assuming that every event under consideration is 
Completely independent of every other event. 

Theoretical frequencies for the empirical data in Table XXV are inde- 
terminate—except as they might be estimated from probabilities yielded by 
extraneous empirical data. When hypothetical probabilities do stem from 


babilities of obtaining » correct guesses in 4, 

h equation [3'] are given in the third column of 
heoretical frequencies appearing in the fourth 
legitimate estimates, a value of x? could be 
lumn of the table and used to test the 
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other empirical results, the hypothesis that can be tested may be quite ies sree 
from the one that would be tested if a binomial distribution function vieldec 
the theoretical values. For example, the data of Table XX\ may be compared 
with somewhat similar findings published several years ago by Goodfellow (9). 
who analyzed the patterning of the guesses on five successive tosses of a coin by 
a large number of radio listeners. These listeners participated in the “telepathic 
experiments” conducted in 1937-38 by the Zenith Foundation. The coin was 
tossed in the broadcasting studio in Chicago. A total of 5.687 members of the 
radio audience wrote down their guesses in order and mailed in their answer 
sheets. They were told in advance that they were to make five guesses, but 
were not told until two or three weeks after the broadcast how the coin had 
actually turned up each time. The conditions of guessing were not identical 
with those holding when the data in Table XXV were secured, but were similar 
enough to permit a comparison of the results. 

Goodfellow (9, Table II) tabulated the results on the radio listeners in a way 
closely approximating that used in Table VIII. In fact, the only difference of 
any consequence is that he tabulated percentages instead of frequencies. The 
"correct" pattern of H T T H was checked against the 32 patterns in Good- 
fellow's table to obtain the proportions of individuals in the total group of 
5,687 that hypothetically made 0, 1, 2, 3, and 4 correct guesses. These propor- 
tions are given in Table XXV, in the column headed F,'. The resulting value of 
X^, as shown in the last column of the table, is 5.774 with df =4. The hypothesis 
that may now be tested is that the patterning of the guesses by the 240 students 
was the same as the patterning of the first four of five guesses by the large 
group of Zenith Foundation listeners. The obtained value of x? falls near the 
20% level of confidence and provides no basis for rejecting this hypothesis. If 
the value of x? had been large enough to justify a rejection of the hypothesis, 1t 
would not be possible to decide whether the patterning tendencies were basically 
different in the two groups or whether the differences in the conditions of 
guessing produced an apparent difference in patterning. Nevertheless, the test 
of the hypothesis, as stated, is an exact one. 
indefiniteness which was present when a value of x? was based on theoretical 
frequencies obtained with a binomial distribution function. Because of the 
strong likelihood of interdependence between the guesses of individual guessers, 
the highly significant value of x? in the fifth column of Table XXV could be 
interpreted to mean that all guessers were biased, that some of the guessers were 
strongly biased while others were unbiased, that the probability of a guess of 


heads was not .50, that the probability of a guess of heads was .50 on some 
tosses but not on others, etc. The absence of independence and the consequent 
inability to obtain unequivocal theoretical (chance) frequencies made this ap- 
plication of the chi-square test a meaningless procedure. 

Second illustration. Another illustration of the indeterminateness of theo- 
retical frequencies comes from a Paper by Seward, Dill and Holland (28) 
These investigators were concer 


This is in sharp contrast to the 


tangular window in the panel. 


BÓ — 
———— See ieee 
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In the "learning" series, the colors (except blue-green) were exposed once 
each in a predetermined order and the subject pressed the buttons until one 
was found which turned on a light. Nine of the ten colors were each paired with 
one of the buttons. The blue-green color was associated with two buttons, the 
fifth and eighth. The ninth button was a blank. The blue-green color was 
presented twice in the "learning" series. When first presented, the light was 
connected with the fifth button for half of the subjects and with the eighth 
button for the other half. On the second presentation, the connections were 
reversed. Thus each subject had an opportunity to develop associative con- 
nections between blue-green and. button 5, and also between blue-green and 
button 8. 

In the “test” series, the ten colors were again presented in a predetermined 
order and each subject was given six chances to push the correct button for each 
color. In this series, blue-green was exposed last and the connection was such 
that the pressing of botton 5 or button 8 would turn on the light. The testing 
on any color was terminated as soon as the subject pushed the button that 
turned on the light or had pushed six buttons. 

The investigators applied the chi-square test to the results on blue-green in 
an attempt to determine whether or not learning had occurred. Values of F, 
and F, taken directly from their Table I (28, p. 231), are shown in the first two 
rows of Table XXVI. As seen, 24 of the 110 subjects pressed either button 5 or 
button 8 on the first trial. Twenty-five subjects pressed one of these two but- 
tons on Trial 2, 22 on Trial 3, ete. Ten of the subjects failed to find either of the 


correct buttons in six trials. 
TABLE XXVI 


DATA FROM THE LEARNING EXPERIMENT OF SEWARD, DILL AND HOLLAND (28) 


Trials Required to Find Correct Button 


- Total 
1 2 3 4 5 6 >6 
Py 24 25 22 21 4 4 10 110 
F, 18.3 16.7 15.0 13.3 11.7 10,0 250 110 
b 2/12 2/11 2/10 2/9 2/8 2/1 ? 
n 110 91.7 75.0 60.0 46.7 35.0 25.0 


The values of F; in the second row of Table XXVI were obtained by assum- 
ing that the probabilities of success on the six trials were those given in the 
third row. In other words, P was assumed to be 2/12 on trial 1, 2/11 on trial 2, 


2/10 on trial 3, etc. A value of x? was computed as follows: 
- — 25.0)? 
= gi- 18.3)? (25 16.7) ++ ao 25.0)? = 31.3. 
18.3 16.7 .0 


The only restriction imposed during the calculations was that R= DF; 
=110. This left 6 degrees of freedom. With this number of df, the computed 
value of x? is highly significant. It led the investigators to reject the hypothesis 

and to conclude that the subjects 


that the button pushing occurred by chance n 
ad, in fact, learned to associate blue-green with button 5 and/or button 8. 
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This application of the chi-square test was unwarranted because — 
cal theoretical frequencies for all trials, except possibly for trial 1, were in ! 
terminate. The reason is that the events on trials 2 to 6 were related to prev ies 
events. The investigators seemed to be partially aware of this fact, as is show B 
by their choice of values of p. For example, consider the probability of succ ces 
on trial 4; it was put at 2/9. This probability arose from the assumption that 
each subject remembered and avoided the three buttons that had already been 
pushed, completely forgetting, meanwhile, all choices that had previously been 
made when other colors were presented. Just prior to the blue-green test series, 
the subjects had each pushed up to six buttons in relation to each of the n 
nine colors. It is inconceivable that the subjects could have remembere 
perfectly the buttons that had been "used up" either during the blue-green 
series or before on other series, unless they adopted “systems” of pressing—a 
system, for example, of pressing from either end toward the center, or from the 
center toward either end, or from the third button toward the right, or from the 
tenth button toward the left, etc. The adoption of any system or systems of 
pressing by any subject would mean that any hypothesis concerning chance 
proportions was open to rejection prior to the calculation of a value of x 

When a decision concerning the legitimacy of theoretical probabilities is 
difficult to reach, it is sometimes helpful to see what happens if different proba- 
bilities are secured on the basis of new and not-unreasonable assumptions. It 
could be assumed, for example, that the 110 subjects used by Seward, Dill and 
Holland had, during the learning and early test series, discovered (on the 
average) that three of the twelve buttons (the two at the ends perhaps, and one 
in the middle) ‘‘belonged"” to three colors other than blue-green, and thus by 
inference did not belong to blue-green. When blue-green was finally exposed, 


TABLE XXVII 


ALTERNATIVE ANALYSIS OF THE DATA or SEWARD, DILL AND HorrAxD (28) 
SS 
a a FN 
Trials Taken to Find Correct Button 


Total 
1 2 3 4 5 6 6 
F, 24 25 22 21 4 4 10 110 
F, 24.4 21.4 18.3 15.3 12.2 9.2 9.2 110 
P 2/9 2/8 2/1 2/6 2/5 2/4 ? 
n 110 85.6 — 64.2 45.9 30.6 184 


three of the buttons were already '*used up" 
The theoretical frequencies might be the one 
XXVII. A value of x? for this table is calcu 


a (24 — 24.4) (25 — 21.4)? 
24.4 ag teh 
this value falls slightly below the 5% 


sumptions and, regardless of its confid 
ngful conclusions; but the fact that i 


and were not likely to be pressed. 
s given in the second row of Table 
lated as follows: 


(10 — 9.2) 
9 


= 12.005. 
With 6 df, 
dubious as 
for meani 


level of confidence. It rests E 
ence level, provides no groun 5 
t differs so greatly from the X 
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value of 31.3 computed for Table XXVI strongly suggests that unequivo 1 
theoretical frequencies are indeterminate in both cases. " wis 


It is not possible to provide rules-of-thumb for deciding whether 
theoretical frequencies are calculable or incalculable in particular situa- 
tions. Decisions must ordinarily be based on careful logical analysis 
However, it is usually true that theoretical frequencies are incalculable if 
the observed frequencies are in any way related, and also if mutually con- 
tradictory assumptions can be made, with about equal justification, con- 
cerning the likelihood of occurrence or non-occurrence of the events (re- 
sponses) that yielded the observed frequencies. 


SPECIAL PROBLEMS: II. THE NATURE OF IMPOSED RESTRICTIONS 


All restrictions that are imposed during the determination of theoret- 
ical frequencies should be both linear and homogeneous. This limitation 
is seldom mentioned in either theoretical or practical treatments of the 
chi-square test, and even when mentioned is usually left unexplained. 
The main reason for the omission is that anything beyond a very super- 
ficial explanation cannot be given in other than mathematical terms. It 
follows that only a few very general ideas can profitably be included 


here. 
It is probably quite obvious to most readers why restrictions must 


be imposed—why the sums of the observed and theoretical frequencies, 
for example, must always be equalized. The value of EF, is fixed for 
any set of empirical data. The value of ZF: cannot "wander around 
any place" without at times yielding entirely impossible cell frequencies. 
Any hypothesis must be tied down at some point to the sample data. 
There should be as much freedom as possible (for example, as much 
freedom as possible for fluctuations in the individual cell frequencies) 
and yet there must be enough restrictions to bring the over-all values 


within the same general area. . 
The one restriction that must always hold may be symbolized as 


follows: 


XR-XF. [21] 


or 
YE — Fi) = 0. [21a] 


2 The excellent attempt at elementary explanation offered by Greenhood (11, Chap. 
3) is about as non-mathematical as it could conceivably be made, and yet requires a 


Considerable amount of mathematical sophistication. 
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This restriction is clearly linear. " 

Some of the other restrictions that must be imposed in familiar 
applications of the chi-square test may be shown to be lincar—and to 
be homogencous also. Suppose, for example, that a normal curve is 
to be fitted to an array of observed frequencies. It is not enough to 
impose the single restriction specified by equation [21]. The reason is 
that a multitude of different combinations of values of F, all arising 
from normal distributions, will each, when summated, equal EF, A 
second restriction must obviously be placed. It may be written 


DPX = FX, [22] 


or 


25 X(F, — Fi) = 0, 22a] 


where X is the measure associated with the cell. Stated in more familiar 
verbal terms, the restriction is that the means of the observed and 
hypothetical arrays of measures shall be equal. 

There is still too much freedom for values of Fa if à normal curve 


is being fitted. The "scatter" of the hypothetical measures must also 
be restricted. In symbols, 


L FX = px, [23] 


or 


25 XF, — F) = 0. [23a] 


These equations, in effect, state that the variances of the observed and 
hypothetical arrays shall be the same. 

The three restrictions represented by equations [21], [22], and [23] 
are imposed when a normal curve is fitted to a set of observed fre- 
quencies.** The equations are all linear and homogeneous in (F,— F2: 
This is best seen from a comparison of equations [21a], [22a], and [23a]. 

Examples of non-linear restrictions. 
posed in applications of the chi-squ 


However, there are a few situ 
One of these has alre 


As a general rule, the restrictions im- 
are test meet the linearity requirement. 
ations where non-linear restrictions are made. 
ady been mentioned in the discussion of the phi-gamma 
function. In the phi-gamma case, theoretical frequencies are commonly 0b- 
tained through a process that involves the minimizing of the sum of the squares 
of the differences between empirical and theoretical values of y, where y and 
are non-linearly related. This non-linear relationship is clearly indicated by 

? One exception to this generalization should be mentioned. If values of the hypo" 
thetical mean and hypothetical variance are not estimated from the empirical data but 
come from some extraneous source, the only restriction is that LA=} Fe 
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equation [15]. Despite the lack of linearity, the chi-square test is often applied. 
It must be realized that this test is not a rigid one; any estimate of x? obtained 
from the differences between observed and theoretical values of p is not neces- 
sarily distributed as x*. Consequently, whenever x? is used in evaluating the 
goodness of fit of the phi-gamma function, conclusions regarding the fit must be 
made with caution. 

Another situation where a non-linear restriction is imposed in calculating 
theoretical proportions is in the use of Thurstone's Case V in treating data ob- 
tained by the method of paired comparisons (13, Chap. VII). The observed 
proportions are based on comparative judgments, the various stimuli being 
compared with each other. The proportions are translated into normal devi- 
ates (that is, into z-scores). Several z-scores typically enter into the calculation 
of each scale separation (distance). A scale separation based on several z-scores 
may be reconverted into a single "theoretical" z-score, the s-score then being 
used to obtain a theoretical proportion. A chi-square value computed from the 
differences between observed and theoretical proportions would not necessarily 
be distributed as x?. Ordinarily, of course, the chi-square test is not applicable 
to paired comparisons data because of a lack of independence between the ob- 
served proportions. . f 

Non-linear restrictions are always imposed in obtaining theoretical fre- 
quencies or proportions if the reduction (transformation) process is used in 
curve fitting. (Almost any method of estimating parameters for complex func- 
tions may involve non-linear restrictions.) To illustrate: Grant and Norris (10) 
fitted a logarithmic function to the percentages in Table XXIV. In doing this, 
they probably transformed values of the independent variable (that is, values of 
It) into their logarithmic equivalents and used these in estimating the parame- 
ters for the logarithmic function. If they followed such a procedure, the calcu- 
lated theoretical percentages were dependent upon a non-linear restriction. 

Unfortunately, there is no way of determining the exact influence of non- 
linear restrictions on estimated values of x*. Therefore, if the chi square test is 
applied, despite the imposition of non-linear restrictions, the iny € must 
be extremely cautious in interpreting the results,?' bearing in m t e as 
Cramér shows (5, Ch. 30), the calculated value of x? is probably somewhat 


larger than it would be if the restrictions were linear. 


CONCLUSIONS 


now have correctly concluded that the chi- 
However, it usually cannot be 


applicable and it thus stands as 


Most readers will by 
Square test has a restricted usefulness. 


replaced in those situations where it is i i 
à valuable research tool. Perhaps the chief trouble is that the test is 


too often applied without adequate prior planning; it is frequently hit 
upon" and adopted after data have been collected and sometimes after 
Other techniques of statistical analysis have been found unproductive. 
hen the F statistic, based as it is on two independent 


dness of fit. For example, several of the F-tests 
tness on the type of restrictions placed 


*! These statements hold also w 
estimates of x?, is applied in testing goodne 
Proposed by Lindquist (22) depend for their exac 
in estimating the parameters of the fitted functions. 
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The aim of every investigator should be to plan, in advance, not only 
every detail of every experiment but every step in the analysis of the 
anticipated data. All contingencies cannot be foreseen; but if the chi- 
square test is to be employed, there is no good reason for failing to 
provide for independence among the measures and for frequencies of 
adequate size. 

There should seldom if ever be any compromising on the requirement 
of independence. There should usually be no compromising on the 
size of frequencies. There are occasions, of course, when it is very time- 
consuming and perhaps very expensive to add more cases to a mere 
handful. The best procedure under such circumstances is to try for 
an experimental design which utilizes each subject to the limit and leads 
to an analysis of data on an individual rather than a group basis. If 
it turns out that only a few subjects can be studied and the data on cach 
one cannot be analvzed separately, it may still be possible to find a 
method of analysis which is more exact than the chi-square test. For 
example, if the data can be arranged into a 2X2 table and the individual 
cell frequencies are less than 10, the exact treatment proposed by Fisher 
(7, pp. 96-97) is to be preferred to the chi-square test. The treatment is 
rather tedious to apply, but in view of its exactness there is no adequate 
excuse for avoiding it. 

Many users and would-be users of the chi-square test gain erroncous 
impressions from what they read about limitations on the size of theoret- 
ical frequencies. A textbook says that frequencies of less than 10 are 
to be avoided. This statement is often interpreted to mean not that 10 
is a limiting value to be exceeded whenever possible, but that 10 is 4 
value around which the various theoretical frequencies may fall; and 
if an occasional frequency happens to be as low as 4 or 5, that is all 
right because other frequencies will be larger than 10 
will average out in the end. A textbook that gives 5 as the suggeste 
minimum tends to encourage the retention of impossibly small theoret- 
ical frequencies. And so does a text which states, in effect, that Yates' 
correction for continuity should be applied if the cell frequencies are 
5 or less and precision is desired. This implies not only that frequencies 
of less than 5 are quite acceptable, but also that Yates’ correction is an 
antidote for small frequencies. Both implications are fallacious. 


The following excerpts from Yule and Kendall (35, p. 422) may help 


and everything 
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to dispel false notions concerning the size of theoretical frequencies and 
also concerning the size of N: 

In the first place, N must be reasonably large. ... It is difficult to say 
exactly what constitutes largeness, but as an arbitrary figure we may say that V 
should be at least 50, however few the number of cells. 

No theoretical cell frequency should be small. Here again it is hard to say 
what constitutes smallness, but 5 should be regarded as the very minimum, 


and 10 is better. 


Moel (14, p. 191), while giving 5 instead of 10 as the recommended 
minimal value of F; nevertheless emphasizes the importance of having 
a fairly large value of N by stating that if the number of cells or cate- 
gories is less than 5, the individual theoretical frequencies should be 
larger than 5. Cramér (5, p. 420 f.) firmly recommends a minimal value 
of 10 and says that if the number of observations is so few that the 
theoretical frequencies, even after grouping, are not greater than 10, 
the chi-square test should not be applied. In all but one of the illustra- 
tions used by Cramér, the theoretical frequencies are considerably 
larger than 10, and in the exceptional case, he admits (p. 440) that the 
frequencies are smaller "than is usually advisable." An investigator 
handicaps himself whenever he applies the chi-square test in relation to 
small theoretical frequencies. 

There are a few applications of the x? test which have not been de- 
scribed and illustrated in the present survey, either because they are 
quite specialized in character or because they provide only approximate 
solutions. One of the specialized applications which may be of interest 
to some readers is Bartlett's test of the homogeneity of variance (3). 
Those who do not have access to Bartlett's original discussion will find 
à description of the test in Snedecor (31, pp. 249-251). Another 
specialized application of interest is the use of x? in setting the confidence 
limits for a population variance from a known sample variance. The 
procedure is explained by Hoel (14, pp. 138-140) and need not be 


included here. i i 
In general, any suggested applications of x° which deviate front the 


well-established tests should be avoided except by those qualified to 
evaluate their full import or upon the advice of an expert. 
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TASTE BLINDNESS TO PHENYL-THIO-CARBAMIDE 
AND RELATED COMPOUNDS 


JOZEF COHEN anp DONALD P. OGDON 
University of Illinois 


I. THe PROBLEM 


In 1931, Dr. A. L. Fox of the E. I. du Pont de Nemours Company 
was preparing a quantity of phenyl-thio-carbamide (hereafter referred to 
as P.T.C.) when a gust of wind blew some of the compound about. 
Dr. C. R. Noller, working in the same laboratory, complained of the 
bitter taste. Fox believed Noller to be mistaken, and put some of the 
crystals on his tongue. The taste sensation was neutral. Noller tasted 
some of the crystals and emphatically insisted that the compound was 
extremely bitter. 

This incident touched off a series of experiments concerning the 
relationship of this and similar compounds to individual gustatory 
discrimination. Persons failing to taste the substance were termed 
"taste blind," following an analogous anomaly in color vision. 

Fox (43, 44) reported that taste blind observers constitute about 
40% of the Caucasian American population. This figure was confirmed 
in approximation by Blakeslee and Salmon (7), Snyder (37, 38), Fern- 
berger (15), Blakeslee and Fox (6), and Parr (28). (See Table I.) 

As it happened, the fact that some substances may taste differently 
to different people was not discovered by Fox, but by Laselle and 
Williams (22, 42) in 1926. These last investigators noted that creatine, 
a muscle constituent, was as tasteless as chalk to some people, and very 
bitter to others. They noted that one pound of lean meat may contain 
close to two grams of creatine, and it is probable that this would cause 
meats to have a distinctive flavor to the taster. This taste difference 
would be especially pronounced in soups made from lean meats which 
must contain considerable amounts of creatine. 


i The structural formula of P.T.C., sometimes called phenyl-thio-urea, 
is given by 
d —NH—C—NH, 
YV 
Para-ethoxy-phenyl-thio-carbamide i 
to P.T.C., and bears the same taste 
formula is 


S a substance closely related 
characteristics. Its structural 


om-o- (y an Hs 
EN 
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In the above, if the single sulphur atom is replaced by an atom of 
oxygen, a commercial sweetening agent called "Dulcin" is obtain d 
t E ained, 


whose structural formula is 


@ N—NH—C—NH; 
C4H,—O—[. | i 
EN O 


Dulcin is used interchangeably with sugar in the manufacture of candy 
with the difference that 250 times as much sugar as Dulcin is ud 
to achieve an equivalent sweetening effect. The similarity in chemical 
and Dulein was reported by Fox on several 


structure between P. T.C. 
Occasions (43, 44, 16, 17). 


. Indeed, it was this close rel 
E the entire family of compounds to determine the chemical 
ese of taste. Elaborate experimental evidence shows that it is the 
CS linkage which determines the taste of the substance. Fox (16) 
r many related mono-aryl-thio-carbamides 
Of these compounds, only two 


ationship which encouraged Fox (17) to 


ports taste differences fo 
nt symmetrical di-aryl-carbamides. 
Vaid e found which deviate from the class to which they belong, 
Mur Sada ee this deviation to insolubility. Fox (43) and Riddell 
grew * E (29) report similar taste differences for the entire thiourea 
jo. ns extensive work with di-phenyl-guanidine is reported by 

: and Davidson (40). It is interesting to note that tetra-methyl- 
a sour taste to both tasters and non- 


thio-e; roe 
carbamide is reported to have 
If is reputedly nauseating, neither 


tint while thio-carbamide itse 
nor sour to both tasters and non-t 
wn em and Fox (6) and Blakeslee 
as » ew individuals for whom PIG. t 
MM tors report a small portion of the popu 
eae ai salty, camphory, sulphury, etc. 
ce (14) are an error of experimental procedure wi : 
iem and Blakeslee and Fox (6) who insisted that these co 
rately be described or classified as odors. 


asters. 

(5) report that there are a 
astes sweet. Almost all in- 
lation for whom P.T.C. 
That these and similar 
as suggested by Blakes- 
uld more 


CAL FACTORS 


II. PrivsiOLOGI 
) have reported th 


at P.T.C. 


Thres 
has reshold Data. Blakeslee and Salmon (8 
been "€ Widest range of thresholds of any substance studied. It has 
s acial groups) that the distribution 


n 
SE uy established (for several r 
“Ported | thresholds forms a bi-moc 
"Ox (6) d Blakeslee and Salmon 
» Parr (28), Rikimaru (32, 33, 34 


lal distribution. This evidence is 


(7), Blakeslee (3), Blakeslee and 
Ji Setterfield, Schott and Snyder 
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(36), and Matson (26). Asan illustration, we have plotted the data of 
Setterfield, Schott, and Snyder (36) in Fig. 1. 

The individual threshold variability to P. T.C. is very large. Blakeslee 
and Salmon (8) disclose experiments with ten subjects who were exam- 
ined four times a day during from four to nine days. Thresholds for a 
given individual were found to vary by a factor of from 2 to 256. Further 
experiments testing seven individuals at fifteen-minute intervals indi- 


a 
eo 


Frequency 
w 


20 


: 7 — 1172! 
1 a .01 .001 .0001 -00 
Fraction of 1 Per Cent P.T.C. Solution --log 


Fic. 1. Bimodal distribution of taste thresholds among tasters to P.T.C. The pop" 
lation is Caucasian. The number of individuals with a threshold of less than .00001 ° 


1% P.T.C. solution is 13. These data are plotted from tables given by Setterfield, Schott 
and Snyder (36). 


cated that changes occur in this short time—and even in a time as short 
as five minutes. No regularity was found in either the direction OF the 
amount of change, nor was there any rhythm in the changes. 

Blakeslee (3) and Cook (14) note that threshold measurements for 
P.T.C. do not correlate with threshold measurements for quinine 
picric acid, hydrochloric acid, saccharine, or salt. f 

Temperature in itself has no effect on the taste characteristics ° 
P.T.C. When the solution is cold, however, less P.T.C. can be dissolV^ 
in it, and Blakeslee and Salmon (8) report a delayed reaction to yo 
solutions. Analogously, Blakeslee (3) reports that many non-taSt^. 
may taste a hot solution, since more P.T.C. may be dissolved at ad 
temperatures. 


^ na . ers 
Saliva Condition. The saliva condition of the tasters and non-tast 


i rc ee n» ^ rn 
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has been the subject of much experimentation. Blakeslee (2), Fox (17), 
Mee (27), Blakeslee and Salmon (8) have all suggested that it may be 
the saliva which is the determiner of the taste or non-taste and not the 
"taste apparatus” itself. Fox (17) specifically suggested the possibility 
that non-tasters have in their saliva a product, possibly a protein or a 
colloid, which precipitates this compound as a very insoluble substance 
which causes no taste sensation. 

Snyder (37, 38) and Blakeslee (3) report that there is no correlation 
between the rhythmic change in the pH of the saliva and taste acuity. 
Blakeslee and Salmon (8) dried the tongues of each of their subjects 
with blotting paper and then rinsed these tongues with water. Four 
of six subjects were rendered less sensitive after this treatment, and the 
degree of desensitization varied considerably among the four testees. 

The problem of the influence of saliva was analyzed by Cohen and 
Ogdon (13) in 1949. They reasoned that this question might be settled 
by allowing tasters and non-tasters to taste using another taster's or 
non-taster’s saliva. Their results indicate that an individual will have 
the taste sensation when the following two necessary conditions are met: 
(1) he must have the correct "taste apparatus" and (2) he must have 
his own saliva. A non-taster cannot taste under any conditions, even 
when he uses the saliva of a taster. A taster cannot taste under any 
circumstances, except when he uses his own saliva; he cannot taste if 
he uses the saliva of another taster or non-taster. He can taste if he uses 
his own saliva, even though the saliva is placed on his tongue in exactly 
the same manner as the saliva from another individual. No observer 
can taste P.T.C. when the crystals are dissolved in water and no saliva 
1S used at all. : 

By way of theoretical explanation, salivas are probably as different 
as fingerprints. Blakeslee and Salmon indicate that the saliva is im- 
Portant and Cohen and Ogdon note the fact that saliva is necessary to 
taste P.T.C. The observation that it must be the individual s i saliva 
may be brought about by the fact that the “taste apparatus — 
Over the years, extremely sensitive and specialized to the particular 
Saliva which the individual possesses. This being the case, when other 
Saliva is introduced, it is equivalent to water, and no a 
Tesults, Additional experiments by Cohen and Ogdon indicate that 
Saliva may aid in the taste of other substances, but this effect is most 


Pronounced with P.T.C. 

_ Sex. The sex factor in the ability t 
With conflicting results. Originally, 
“Nciency was not related to sex, an 


o taste P. T.C. has been investigated 
Fox (43, 44) reported that taste 
d this was confirmed by Snyder 
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(37, 38), Rikimaru (32, 33, 34), Setterfield, Schott and Sever P 
and Snyder and Davidson (40). On the other hand, Fernberger (15) anc 
Bovd and Boyd (11, 12) report the taste ability between the sexes to 
be ‘significant. These last investigators published the ratio of female 
tasters to male tasters in Wales to be 1.66. The Fernberger ratio 15 
only 1.11. Slight sex differences are noted in most other papers, but 
they are statistically insignificant. : 

Age. The possibility of age influencing the ability to taste p.T.C. has 
been investigated by Blakeslee and Salmon (7), Rikimaru (32, 33, 34), 
and Snyder (37, 38). On the whole, the age factor may be considered to 
be unrelated to taste ability. The apparent slight increase in acuity at 
certain ages, reported by Rikimaru, is probably insignificant. 

Smoking. Hall and Blakeslee (20) report extensive experiments con- 
cerning the effect of smoking on ability to taste P. T.C. These experi- 
menters set down the following conclusions: (1) Smoking has a definite 
effect upon taste acuity. (2) The time which must elapse after smoking 
before the threshold returns to a resting position varies with the indi- 
vidual. In these experiments only 58% returned to normal within one 
hour. (3) The incipient effect of smoking is the direct dulling of the 
"taste apparatus." (4) The true effect of nicotine on the nerves associ- 
ated with taste appears to be the same as for other nerves— initial 
stimulation followed by depression. 


III. GENETIC AND ENVIRONMENTAL FACTORS 


The ability to taste P.T.C. is inherited as a simple unit-factor gone 
nant in accordance with the primary Mendelian principle. This 1$ m 
extremely important characteristic of P. T.C. since it affords the genet! 
cist a simple test of the Mendelian laws. 

Family Studies. In 1931 Blakeslee and Salmon (8) and Snyder (35) 
independently discovered the simple recessive character of taste plind- 
ness. They classified the parents of families into three groups according 
to their phenotypes—both parents tasters, one parent taster, neither 
parent taster. The taster vs. non-taster characteristics of the offspring 
were measured, and the Mendelian nature of the transmission infer. 

In the following year, Snyder (38) conducted a monumental stu” 


i x : P 3 . eimple 
tory of the Ohio State University. His theory concerning the $!” ie 
Mendelian inheritance was confirmed. Other confirmation using fa™ 


i 
has come from Rikimaru (32, 33, 34, 35), Strandskov (41) and Lev 
and Soboleva (25). 


: m : 244 bY 
Twin Studies. Twenty pairs of twins, classified as identical ^ 
relatives, were studied by Rife (30). Each pair of twins had the 
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taste characteristics with the exception of one pair of apparently mono- 
zygotic twins. But in this case the non-taster twin had suffered a general 
taste deficiency, which may have been the result of a severe case of 
poliomyelitis. Ardashnikov, Lichtenstein, Martynova, Soboleva and 
Postnikova (1) investigated 137 pairs of identical twins and found only 
three. pairs who differed in their reactions to P.T.C. The non-taster 
sibling of one of these pairs was pathologically handicapped. However, 
no physiological or psychological anomaly was reported for the other 
two pairs who differed in reaction. 

A somewhat more elaborate twin study has been conducted by 
Botsztejn (10) wherein he divided his population into three groups— 
tasters, taste weak, and taste blind. Using groups of identical and frater- 
nal twins, he concluded that hereditary transmission followed a multiple 
allele series. Taster was dominant to taste weak, and taste weak was 
dominant to taste blind. 

Transmission of Taste to Related Compounds. Snyder and Davidson 
(40) report that the inheritance of taste to di-phenyl-guanidine follows 


the same laws as P.T.C. Riddell and Wybar (29) conclude that taste to 


thiouracil is inherited in the same manner. 

Linkage in Transmission. All investigators report that taste ability 
to P/T.C., is neither sex-linked nor sex influenced. Snyder, Baxter, and 
Knisley (39) have found that reaction to P.T.C., blood groups, and blood 
types are all inherited independently, nor is it related to brachydactyly 
(21). 

Environmental Factors. 
P.T.C. may be modified by environ 


been secured to confirm this. 


Blakeslee (3) has suggested that taste to 
nental factors, but no evidence has 


IV. ETHNOLOGICAL FACTORS 


The simple inheritance characteristics of taste to P. T.C. have en- 


couraged anthropologists to attempt the tracing of ethnological groups 
; populations. It was hoped that the 


by tł 5 i 
/ the per cent taster in various 
the same manner as blood 


numbers would form rough racial clusters 1n 


Broups 
vestigations by interested scientists 


is given in Table 1. Unfortuntely, no ordering seems to result. Indeed 
the data themselves are internally inconsistent. Witness the range of 
Merican Üauendian from 60% to 82%. This indicates that tiie Experi- 
mental procedures are weak, and some standard method of administra- 
tion of P.T.C. should be agreed upon. Almost all of the Don tebe 
Merely describe populations and give results; there is little discussion 


eXperimental technique. 


A Summary of the extensive In 
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TABLE I 
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Per Cent or TasrEns to P. T. C. iN Various POPULATIONS 


Group pode Investigator 
Caucasians (American) 60 Fox (43, 44) 
Arabs (Syria) 63 400 Hudson and Peter (28) 
Armenians (Syria) 68 294 Berberian (28) 
Northern Jews (Palestine) 68 245 Yunovitch (28) 
Semenites (Palestine) 68 59 Yunovitch (28) 
Caucasians (American) 68 283 Blakesleee and Salmon (7) 
Caucasians (American) 69 439 Parr (28) 
Caucasians (American) 69 440 Snyder (37) 
Caucasians (American) 70 3,643 Snyder (38) 
Southern Jews (Palestine) 72 175 Yunovitch (28) 
Caucasians (American) 73 1,025 Fernberger (15) 
Egyptians 76 208 Hickman & Marcos (28) 
Negroes (Alabama) n 533 Howard & Campbell (28) 
Caucasians (American) 80 232 Rife & Schonfeld (31) 
Caucasians (American) 82 477 Setterfield, Schott, & Snyder (36) 
Caucasians (Jews) 85 82 Rife & Schonfeld (31) 
Formosans (Chinese origin) 89 5,933 Rikimaru (32, 33, 34) 
U. S. Indians (mixed blood) 90 110 Levine and Anderson (24) 
Negroes (American) 90 3,156 Lee (23) 
Negroes (Kenya) 91 110 Lee (23) 
Japanese 91 921 Fukoka (18) 
Negroes (American) 92 107 Setterfield, Schott, & Snyder (36) 
Japanese 93 8,824 Rikimaru (32, 33, 34) 
Chinese 94 167 Chen and Chain (28) 
U. S. Indians (full blood) 94 183 Levine & Anderson (24) 
Formosans (Aborigines) 95 1,756 Rikimaru (32, 33, 34) 
Egyptian Sudan natives 96 805 Lee (23) 
Koreans 97 55 Fukuoka (18) 


V. SUMMARY 


——— 


This paper has discussed the taste blindness of certain individuals 
to phenyl-thio-carbamide and related compounds. Physiological, genet! 
environmental, and ethnological factors have been considered. 


LEVA, G. V., & Postnixova, E. N. 
The diagnosis of zygosity in twins. 
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ARMY AIR FORCES AVIATION PSYCHOLOGY 
PROGRAM RESEARCH REPORTS! 


SPECIAL REVIEWS 
FLANAGAN; Joun C. The aviation psychology research program in the 
Army Air Forces. AAF Aviation Psychology Program Research Re- 
ports, Report No. 1. Washington, D. C.: U. S. Government Print- 
ing Office, 1948. Pp. xii+316. $2.00. 


: On July 15, 1941, John C. Flanagan was commissioned as a major 
in the Army's Officers’ Reserve Corps and was at once assigned to duty 
in the Medical Division, Office, Chief of Air Corps. This was the begin- 
ning of the extensive research enterprise which was to become the Avia- 
tion Psychology Program of the Army Air Forces. Four years later the 
program ineluded some 200 officers, 750 enlisted men, and 500 civilians. 
At that time, Colonel Flanagan could look back with pardonable pride 
Upon a distinguished record of leadership in research planning, organiza- 
ton, and application. The comprehensive account of this work in the 
nineteen volumes of research reports under review in this issue of the 
Psycnorocicar BULLETIN symbolizes the effective manner in which the 
Program developed under the direction of Colonel Flanagan and his 
many able associates. In historical perspective, this collective research 
effort will stand out as an important landmark in the evolution of ap- 
ry substantial contribution towards the 
1 World War II. 

agan presents a general account 
s: (1)a record of the expanding 


plied psychology, as well as a ve 
effectiveness of the Army Air Forces it 

In the first volume of the series, Flan 
of the entire program, in three main part 1 
Scope of the program, in terms of problems studied and of official direc- 


tives on organization and functions; (2) a summary of the principal 
research findings and their applications to AAF problems; (3) an ap- 
Praisal of the ‘general contributions to theory, knowledge, and tech- 
Niques of aviation psychology. The second section 1s a condensation of 
the results reported in the other ¢ are reviewed 


ighteen volumes, which 

l aos acd 

ndividually in subsequent pages of this ae 

siy Phe official account of the expanston o, t : 

Sts In the beginning, research was restricte 
Nits scope was broadened to inc 


program is most impres- 
o the selection of pilots. 


lude bombardiers and navigators. 


is series were issued over a period of more 
1 of 1919. In spite of the long 


rtly to printing difficulties, it was decided to 


es in th 


lm. 
Editor's Note, The nineteen volum à 
ng in the spring 


tha 
an tu ; 
two years, the last report appear! 


delay ; 

ayi 3 

Dos, n the completion of the series, due par $ eA available. The 
Pone the publication of any of the reviews unt! 


adya k ntire program 

antages of si nated i 
Seeme; aw of simultaneous and somewhat coordi n tineresearch 
“ to overweigh the possible decline of interest 1 


des c 
ements with the passage of time. b 


1 all of ther 
evaluation of the e 


these notable war- 
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Eventually gunners were added, and so the selection and classification 
of all air-crew personnel became the responsibility of the psychological 
research units. Attention then turned to the complex field of training, 
in which some of the most outstanding work in the program was done. 
In late 1943 interest in operational problems was developing, since the 
ultimate validation of selection, classification, and training procedures 
could only be found in combat performance. In May 1944 an evaluation 
and research detachment was flown to the European theatre to study 
combat operations. Permanent rescarch units for combat commands 
were later authorized, but the war ended before they could go overseas. 
Finally, at the end of the war, the conception of an integrated research 
organization had crystallized which would study selection and classifica- 
tion, training, utilization of personnel, and the human factor in the 
design and operation of equipment. Although this plan, adopted in 
1946, was to be largely supplanted two years later by a more compre- 
hensive organization, these earlier recommendations of AAF psycholo- 
gists laid the foundations for an integrated program of research and de- 
velopment in the Air Force. . 

In the third part of the book, Flanagan summarizes the general scien- 
tific and technological achievements of the AAF psychological research 
program. Like all other wartime research efforts, the gains were far 
greater in the technological than in the scientific dimension. One looks 
in vain for really significant scientific progress in any of the areas dis- 
cussed; immediate military necessity would not permit long-range, basic 
research. 

The classification research made possible extensive studies of the 
interrelations of traits, which seemed to support an atomistic theory O 
psychological organization. The factor analyses and the low correla- 
tions among apparently quite similar activities both indicated that "the 
number of basic uncorrelated aptitude factors . . . (is) probably much 
larger than most experts had supposed." In gunnery training, for ex- 
ample, men who did well on one type of trainer might do very poorly 
on another which seemed quite similar. Because of such results, gen- 
eralization and transfer of training find little place in Flanagan's p8Y^ 
chological scheme of things. (Except in the emphasis upon correlation? 
between test scores and a variety of fairly dissimilar criteria!) 

In the field of training the most outstanding contribution was the 
determined effort to secure adequate criteria of proficiency against 
which both classification and educational procedures could be checked. 
The nature and the determinants of effective behavior on the job Pe 
came the central concern of the program in its later stages. These aC 
tivities were for the most part models of skillful and persistent attac 
upon these most difficult of all problems in applied psychology- 
though definitive solutions to these problems were not as a rule forth- 
coming, the methodological gains were very substantial. Future worker? 


LAN 
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in this field will know better how to approach such tasks, as well as 
what hypotheses and procedures to reject. 
Lye H. Lanier. 


New York University. 


DuBois, Putte H. (Ep.) The classification program. AAF Aviation 
Psychology Program Research Reports, Report No. 2. Washington, 
D. C.: Government Printing Office, 1947. Pp. xiv-+394. $1.50. 


DuBois’ report is an important historical document. It is important 
only in part because much money was spent in the aviation psychology 
classification program—$1,309,821 is the estimated expenditure for 
1943 alone (p. 305)—some of which may be recovered from this book 
and its supporting files in a future war emergency. It is important be- 
cause some of the mistakes made in experimental design can, if viewed 
critically and constructively, point the way to more adequate research 
development of differential selection tools. (The book being reviewed 
does not, however, look critically upon the experimental designs em- 
ployed in the studies.) Furthermore, the report is important for the 
implications it has for organization and implementation of needed per- 
sonnel research programs of large scope in the area of industrial as well 
as military personnel. 

There are six chapters and a brief n 1 j l 
ter deals with planning for selection of trainees and differential classifi- 
cation of pilots, bombardiers and navigators. The second chapter dis- 
cusses the AAF psychological organizations concerned with selection 
and classification of air crew. The organization of the various AAF 
research units and the ten medical and psychological examining units 


are treated in considerable detail. ] 
In Ch. 3 the classification batteries are discussed. The us e] 

to be used in the examining units of the Air Surgeon was = i 
early in 1942, Five changes were made in it during that uio oe z 
changes were made in the next three years. During most al t «d EG 
the Air-Crew Classification Loteries ana approxima ely 
objective-ty i ts and six apparatus tests. — 

"or peni cus sce = many personnel technicians is the manner of 


Weighting the many tests comprising the classification battery: 


i ight- 
The classi i ry of February, 1942, employed an arbitrary weig 

ing oe ee for an immediate wr gr 
a priori system utilizing the tests that were ayallab S. m Wi mene 
to the classification battery of June, 1942: On the. water ecl serica 
analyses of the requirements of air crew, the wins pom um Ba 
revised” (p. 86). For the classification battery of August, ^ A ones 
time, the classification battery was used for selection inste 


summary chapter. The first chap- 


ibed i jew: d 5. 
! Editor's. Note. These tests are described in the reviews of Reports 4 an 
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classification. . .. The weights were readjusted on the basis of professional 
judgment, since validity data were still incomplete and the test in- 
tercorrelations were not available" (p. 90, italics the reviewer's). But in the 
classification battery of July, 1943: “for the first time test weights for all three 
air crew aptitude scores were close approximations to multiple regression 
weights based directly on empirical data" (p. 99). (This weighting system sug- 
gests evolutionary progress.) The classification battery for September, 194. 
included predictor scores for a total of seven air crew specialties: bombardier, 
navigator, bomb pilot, aerial gunner, air mechanic-gunner and radio operator- 
gunner, A final war-time revision of the battery was made in June of 1945, to 
segregate B-29 gunners from other types of gunnery trainees. The reviewer 
believes the following quotation suggests that the 1945 battery has atavistic 
traits: “The weightings for this specialty were based upon job analyses and 
one test, the Pedestal Sight Manipulation Test, with high face validity for gun- 
nery, was introduced into the battery, carrying 20 per cent of the weight for 
the specialty" (p. 103). The systems of weighting for differential prediction 
were, for the most part, quite arbitrary. It is here that a new look at experi- 
mental design can perhaps be of greatest value. 


Chapter four presents results of validity studies concerned with rela- 
tion of total weighted scores, called stanines, and non-test data such as 
age and education to training criteria. The training criteria against 
which these variates were correlated were pass-fail criteria. The fail 
criteria were eliminations from training for flying deficiency, for fcar, 
and elimination at trainee's own request. The weighted predictor scores 
when correlated against such pass-fail criteria for many classes com- 
prising thousands of trainees yielded biserial correlations for most sam- 
ples of pilots that were fairly high and nearly always significant; those 
for navigators were moderately high and frequently significant; those 
for bombardiers were low and frequently not significant. Even moder- 
ately low biserial correlations yield striking predictive efficiency when 
the results are set up in charts, as is done often in this chapter. 

_ One of the crucial indices of effectiveness of a differential classifica" 
tion battery (as contrasted to one designed for mere selection) woulc 
seem to be validity of a battery weighted for one specialty for another 
specialty. For example, how well does the pilot battery work for bom- 
bardier, the navigator battery for pilot, the bombardier battery for 
navigator, etc. Judging from the results presented the answer appear? 
to be that the battery has higher validity for the specialty for which it 
was designed. However the critical observer may ask for: (1) an experr 
mental design specifically aimed at evidence for or against the efficacy i 
a battery in differential prediction as contrasted to selection; and (2) 
safeguards against artificial contamination of the criterion. Did the 
trainee know his stanine? Did the instructor know the trainee's SU 
nine? If so, did such knowledge influence the criteria of elimination fer 


they 


2 Validities of printed tests and of apparatus tests are not included in this book: ith 
WwW 


being given in Reports No. 4 and No. 5; validities of weighted tests when correlated 
combat criteria are also presented in Report No. 17. 
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flying deficiency, for fear, or for elimination at the trainee's own re- 
quest ? 

One table of information is presented in chapter four that is particu- 
larly damaging to the belief that stanines have considerable power in 
differential prediction as contrasted to mere selection. This information 
pertains to the validity of various stanines in predicting scores of 
trainees in pistol and carbine fire! For 350 cases the various air crew 
stanines were found to correlate from .25 to .40 when correlated with 
performance on the ground with the pistol, and from .21 to .31 with 
target performance with the carbine. The author refers to these results 
as evidence of ‘the versatility of the stanines” (p. 179). 

Based on statistical and clinical approaches, Ch. 5 tells what hap- 
pened to 1,311 air crew trainees who had been given tests but who were 
not screened by tests. For this experimental group the pilot stanine cor- 
related .66 with the criterion. For the preliminary qualifying examina- 
tion “only 45 out of 520 men who failed the examination were graduated 
from training, whereas the yield among those passing the examination 
was 211 out of 751" (p. 201). The study reported in this chapter is one 
of the most interesting to come out of wartime personnel research. 

Rocer M. BELLOWS. 


Wayne University. 


THORNDIKE, Rogert L. (Ep.) Research problems and techniques. AAF 
Aviation Psychology Program Research Reports, Report No. 3. 
Washington, D. C.: U. S. Government Printing Office, 1947. Pp. 
viii4- 163. $1.00. 


A research program usually has two outcomes: (1) increased knowl- 
edge in the area being investigated and (2) increased understanding of 
the research methods used. It would have been a shame if the reporting 
of the large-scale AAF Aviation Psychology Program had heen re- 
stricted to the first of these two outcomes. Fortunately, Report No. 3 
was included in the series so that the psychological profession could 
share in the distillation of research experiences provided by that pro- 
gram. . X 
Using the relatively unified aircrew selection and classification pro- 
gram as a frame of reference, Thorndike describes the practical research 
Problems which arose and the adequacy of the methods of investigation 
applied. The report is organized around the major sione in T — 
tion program, separate chapters being devoted to job ana pent an . 
velopment, criterion measurement, determination of test : iabi n an 
validity, and problems of combining and weighting tests when used in a 
battery, The influence upon test results of variables such as apparatus 
model, examiner, time of day, etc., is also discussed. After pointing out 
the problems common to both classification and training research, 
Thorndike adds a brief chapter on problems and techniques peculiar to 
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the latter, such as administrative difficulties, equating of groups, con- 
trol of training variables, etc. . 

If Thorndike's style of writing had a lower Flesch count one might 
classify this report as a "How To" manual. It is this, to the extent that 
it provides an excellent guide to the step-by-step procedures required in 
the validation of an aptitude testing program. The sources of informa- 
tion concerning job requirements are presented, methods for test de- 
velopment are described, types of criteria are compared, and statistical 
methods and computational routines are presented in some detail. As 
such, it will serve as a uscful reference in personnel psychology. 

To consider this report as merely a manual of procedures, however, 
would be a grave injustice. For it performs the much more valuable 
function of taking the reader "behind the scenes" in order to see the 
Why as well as the Mow. Procedures are evaluated as well as described, 
limitations as well as strengths are pointed out, and theoretical implica- 
tions as well as practical considerations are presented. Some of the 
topics of particular interest to this reviewer were the following: (1) the 
value of job analysis versus trait analysis, (2) objective versus subjec- 
tive criteria, (3) selection versus classification, (4) evaluation of various 
measures of reliability in terms of the sources of variance affecting cach 
of the measures, (5) problems of restriction of range of ability duc to 
prior selection. These and other basic problems are treated systematt- 
cally and the rationale as well as the methods of attack are described. 

The editor recognizes (and clearly states in his Preface) one limita- 
tion of the report, i.e., its restriction in large part to the selection and 
classification program. Another limitation, in the reviewer's opinion, 
was the failure to discuss in detail the study of a non-preselected "ex- 
perimental group” given aircrew training regardless of their test scores. 
Although this project is described in Report No. 2 of this series, 1t 
warranted more than the brief reference in this report in connection with 
the problem of restriction of range. From a methodological point O 
view, it represented a major step in providing crucial evidence on the 
validity of the classification Program, and a discussion of its rationale 
procedures, and general implications for experimental design would ha Ve 
been appropriate. 


ALBERT S. THOMPSON: 
Teachers College, Columbia University, 


MELTON, ARTHUR W. (Ep.) Apparatus tests. AAF Aviation Psychology 
Program Research Reports, Report No. 4. Washington, D. C.: U. > 
Government Printing Office, 1947. Pp. xxi+1056. $3.50. 


. A en 
This report describes the largest program of psychomotor ac m 

research yet undertaken. Although the National Research Council se 

conducted psychomotor selection projects on civilian flyers, little ed 
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ment or information was available and many psychologists believed 
that it was not feasible to administer individual apparatus tests to large 
numbers of candidates. 

The tests developed emphasized measures of precision at high speed 
rather than steadiness, equilibrium, or strength. The standard classifi- 
cation battery at various times included: (1) complex coordination 
(Mashburn type); (2) two-hand coordination; (3) two-hand pursuit; 
(4) discrimination reaction time; (5) finger dexterity; (6) arm, hand 
steadiness and aiming; (7) pedestal sight manipulation. Complex pur- 
suit or continuous coordinations appeared to be most valid for flying 
skills. Major advances in procedure included standardized instructions, 
automatic cycling of trials, and multiple unit controls for testing four 
persons under each experimenter. Further improvements in the design 
of tests and the correlation of findings on specific tests with specific 
reasons for success or failure in flying skills would seem to call for more 
intensive empirical and theoretical job analyses. 

In addition to the development and validation of tests for the selec- 
tion of pilots, bombardiers, navigators, gunners, radar operators, etc., 
most tests were later devised to emphasize some variation in the per- 
formance such as working under distraction, stress or various degrees of 
difficulty, but with little increase in validity. Early test trials, including 
practice trials, were often as valid or even better than additional trials 
at later stages of learning. Rates of learning on various tests were quite 
Specific. 

This suggests to the reviewer that initial work methods adopted by 
a candidate in a new situation were important despite standardized in- 
Structions. The greater validity of more complex tests suggests as a 
second explanation that ability to attend simultaneously to various 
aspects of a complex situation might. have been more important than 
the motor aspects of performance. | Tests in which ipei ee Rus 
usually quite separate from the main operation would not e me e- 
quate test of this theory. The success of work sample tests involving 
coordinated movements of hands and feet in a central task would be 
more relevant. Unfortunately, the most representative work sample 
tests were not employed for classification purposes until relatively late 
When there was no longer an opportunity for large scale validation, but 


results were promising. " 
na redu which frequently "washed out" 35 per cent of those 


Who had already passed medical examinations, intelligence e and 
Several years of college work and in which the average cost of. "s 
training was on the order of $25,000 per man, any reduction in Xv 
Outs" is important, particularly when psychomotor tests cost o the 
order of $5.00 per person. Bar graphs showing the percentage o = 
Sons failing the primary training from each range of a PR 
clearly the savings from the use of single psychomotor tests. Multiple 
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correlations show that the battery of tests gave still greater validity. 
The saving of lives and equipment by elimination of trainees of lower 
aptitudes and the selection of abler trainees for actual combat duty are 
obvious advantages of a more strictly military naturc. 

To the reviewer this program has important implications as to needs 
in future research and training in psychology, e.g.: (1) The training of 
additional specialized personnel for instrument design and constructions 
(2) likewise, the relative neglect of intensive job anal ysis and notion stud y 
in psychomotor performances on military equipment calls for increased 
attention to this phase of training for future psychologists; (3) the fur- 
ther development of more complete factorial batteries of psychomotor 
tests would enable applied psychologists to select tests having the great- 
est probable validities for various skills; (4) the development of fiios 
cedures for setting up specific criteria for success in acquiring cach majo! 
type of skill; (5) the rapid mechanization of military aircraft and equip- 
ment, e.g., remote control aircraft and missiles, may outmode a great 
deal of present testing and require different aptitudes; (6) a standby 
research and selection program, revised periodically, is necessary to pro 
vide an adequate plan for emergencies. This in turn is likely to require 
an administrative plan which will encourage the widespread part icipation 
in research of civilian as well as military psychologists. 

ROBERT H. SEASHORE. 


Northwestern University. 


GUILFORD, J. P. (Ep.) Printed classification tests. AAF Aviation Psy- 
chology Program Research Reports, Report No. 5. Washington, 
D. C.: U. S. Government Printing Office, 1947. Pp. xi+919.' $3.00. 


Printed Classification Tests gives a detailed account of methods, 
results, and conclusions concerning paper and pencil tests used anc 
considered for use in the selection of Army Air Force pilots, navigators: 
and bombardiers. The volume will be of interest not only to aviation 
psychologists but to all psychometricians because of the important 
factor analyses reported. It presents impressive evidence of the cf- 
fectiveness of the organization, direction and continuity of purpose 9 
the group of psychologists who worked in the AAF research program 

The methods followed in the development of the AAF tests were 1™ 
the approved text-book style of test development that is almost never 
pursued so thoroughly in actuality: job analyses were made; hypothese 
were clearly formulated for factors that would appear to be needed; tests 
were given before the groups were selected; factor analyses were made 0 
tests and criteria; new tests were based on the validity and factor stud- 
ies; the process was repeated. This is the first successful attempt to use 


, A e 
! This review has been read and constructive criticisms provided by Mr. Georg 
Burgess and Miss Dorothy Everett. 
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factor analysis in a selection problem to show the factorial composition 
of the criterion as well as of the tests. The criterion unfortunately had 
to be only training school records rather than combat success. Factors 
extracted by the centroid method were rotated orthogonally to an ap- 
proximate simple structure. 

The tests tried were the commercial tests that looked most promising 
plus many new ones that were developed. Tests relating to these arcas 
were examined: intellect and information; verbal ability; mathematics; 
reasoning; judgment, foresight and planning; integration; memory; 
visualization; mechanical; information; perception; size and distance 
estimation; spatial; orientation; set and attention; temperament and 
personality. In addition to tests the following kind of variables are cor- 
related with training records: clinical type procedures, motivation, and 
biographical data. 

Twenty-seven separate factors, rarely the ones expected, were identified 
with varving degrees of clarity. Seventeen factors have positive validity for 
predicting success in pilot training pass-fail criterion. These seventeen factors 
are in order of their validity for the pilot criterion: (1) Spatial Relation, (2) 
Mechanical Experience, (3) Pilot Interest, (4) Integration I, (5) Visualization, 
(6) Psvchomotor Coordination, (7) Perceptual Speed, (8) Length Estimation, 
(9) Memory III, (10) Judgment, (11) Mathematical Background, (12) Integra- 
tion II, (13) Visual Memory, (14) Planning, (15) Paired Associates Memory, 
(16) Reasoning II, and (17) Spatial 1I. . a a 

These factors range in estimated validity for the pilot training criterion 
from .32 for the Spatial Relations factor to .05. Since so many orthogonal fac- 
a single criterion it is to be expected that validities of the 
“Furthermore the large number of cases used in these 
orted are probably more dependable 
d in industrial research studies. 


tors are found in 
factors would be low. 
studies indicates that the validities rep! 
than some of the higher validities reporte 
In his chapter on “General Conclusion" the editor romake s a 
complete classification battery should cover at least 25 age his 
probably overstates the case since the inclusion of those factors has ing 
lower validities does not contribute substantially to the validity of the 
composite. In addition, no more than 15 of the 27 € pers 
adequately enough for inclusion in a comprehensive Lp pulp vd 
tery.? The work that is reported is gigantic. prodi "à pes 
leads to explore for dozens of doctoral dissertations base ES : m 
found and suggested. Perhaps the most important wi us ae tie 
Judgment. Also of importance is that the nature o si ha om rats 
is clarified somewhat. Mechanical Experience comes ou (ie hend 
tor. Splitting what has pue uen prar ci i a Vis a 
and three Space factors needs additiona . . , 
Little adr will be found in this ie ep euge pr 
Rorschach, TAT, other clinical procedures for diagnosis, 


i inti isfact. 
? The reviewer is indebted to Dr. John Dailey for pointing out this fa 
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ity inventories (including Bernreuter, Minnesota Multiphasic, and 
Humm-Wadsworth). Clinical procedures referred to are a clinical inter- 
view, observations of behavior during psychomotor testing, observa- 
tions during a stress situation, observation during a short social situa- 
tion. None of these procedures was found to be useful. Significant at the 
one per cent level were only the Thinking Introversion score of Guil- 
ford's Inventory with a validity coefficient of —.09 (judged not to be 
promising), Cooperativeness and Agrecablencss of the Guilford-Martin 
Personnel Inventory, with validity coefficients of .14 and .12. lwo 
categories on the Humm-Wadsworth, Epileptoid and Hysteroid, gave 
coefficients higher than those in the inventories mentioned above, —.21 
and —.18, but they were significant only at the five per cent level 
since only 202 cases were used. Humm supplied case summaries which 
did not differentiate passes successfully from failures; however, his 
summaries were interpreted by the Air Forces' psychologists. Had 
combat criteria been available, it is possible that some of these person- 
ality measures would have been more useful. 

The volume has shortcomings. The mass of material has not been 
smoothly presented. Clarity is exceeded by bulk. It appears to have 
been done hastily for the magnitude of the undertaking. The index 1$ 
incomplete. Research psychometricians will not be seriously bothered 
by the shortcomings since the book will probably become a bible whose 
failings will probably come to take on sentimental value in the same 
way that philatelists treasure a rare stamp with some slight imper- 
fection. 


Tuomas W. HARRELL. 
University of Illinois. 


Davis, FREDERICK B. (En. The AAF qualifying examination. AAF 
Aviation Psychology Program Research Reports, Report No. 6 


Washington, D. C.: U. S. Government Printing Office, 1947. Pp- 
xvii4-266. $1.25. 


The AAF Qualifying Examination (originally termed the Aviation 
Cadet Qualifying Examination) was used as an initial hurdle in the 
selection of air-crew officers and (after June 1, 1944) in the selection 
of aerial gunners. The volume under review presents: a history of the 
examination (Chapter 1); the principles observed in constructing the 
examination (Chapter 2 and Appendix A); data on the validity, modifi- 
cation, and uses of the examination (Chs. 3 and 4); and studies of spe 
cific subtests considered for the examination (Chs. 5-10). Besides the 
Qualifying Examination, a few other examinations are very briefly 
reported on, viz.: the Aviation Cadet Educational Examination, the 
Flight Officer Examination, the English Expression Test, and the Vic 
tory Corps Aeronautics Aptitude Test (Chapter 11). d 

The volume is a meaty one, as evidenced by the 143 numbere 
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tables. No thoughtful reader can plow through this book in an odd 
afternoon; nor can any conscientious reviewer hope to do it justice 
within a few pages. In brief overview, what do we learn? 


1. Paper-and-pencil tests can be extremely useful in a practical program of 
selection, even for so “non-academic” a function as flying a warplane. The 
Qualifying Examination was definitely useful in selecting men who later became 
successful flying officers. (Chs. 3 and 12.) 

2. Research and experimentation led to a significantly improved battery of 
tests. (Chs. 2 and 3.) 

3. Apparatus tests can significantly 
pencil tests. (Page 39.) 

4. Within the limited scope o 
tests designed to substitute for appar 


tended function. (Ch. 10.) 
5. The theoretical possibility of using a “suppression variable" to improve 


thg validity of a battery of tests was in this study carried as far as constructing 
d administering a special test for the purpose; but unfortunately validity 
Correlations have not yet been obtained. (Ch. 5.) 

observations (about which the re- 
at the points above) may be worth 


augment the validity of the paper-and- 


f the reported experiments, paper-and-pencil 
atus tests do not appear to fulfill their in- 
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3. A capital lack, in the reviewer's judgment, is the failure to explain in 
detail whatever strengths or weaknesses the criterion may have held. (Not 
even an estimated reliability coefficient is offered.) The criterion generally 
employed is simply "graduation or elimination from elementary pilot training 
for reasons associated with proficiency in flying" (p. 2). Such a criterion seems 
extremely general or non-specific: not even the "academic" aspects of training 
are separated from the non-academic. Specific quantitative criteria would be 
more useful, not only for the validation and analysis of tests, but probably 
also as a means of stimulating better observation and evaluation by pilot-in- 
structors. Granted that practical circumstances prevented the use of anything 
but a simple pass-fail criterion, some brief acknowledgment and warning of the 
limitations of the criterion would still have been in order. 

4. Although the Qualifying Examination was validated against the pilot 
criterion, this same examination had to be passed also by prospective bombar- 
diers, navigators, and (after June 1, 1944) aerial gunners. 'The administrative 
convenience of a single screening test is obvious; on the other hand, it seems 
likely that the various groups of flying personnel have differentiable aptitudes 
(see e.g., Table 11.2 on p. 220). (This, of course, is a criticism of an administra- 
tive practice, rather than of the present research report.) 


The central practical question of this volume is: Was the AAF 
Qualifying Examination valid? The most explicit answer to this ques- 
tion is given in a study of over 1,000 "applicants for aviation-cadet 
training" who were entered into beginning pilot-training "regardless of 
their scores on the Qualifying Examination and the Air Crew Classifi- 
cation Tests" (p. 37). In this unselected sample, the validity of the 
Qualifying Examination (series AC12) was about .45-.50 (against the 
criterion of graduation from pilot training) (p. 38). 

There can be little doubt that the last series of the Qualifying Exam- 
ination (the AC14 series) was more valid than the first (the AC10 
series), Because of the almost inevitable lack of comparability between 
the figures given for the first and last form, it is not possible to estimate 
exactly how much improvement in validity was accomplished; the re- 
viewer's tentative judgment is about .20 point of correlation. This 
appears a remarkably large gain. Part of the gain may conccivably be 
due to improvement in the reliability or validity of the criterion; the 
reviewer has no information about this. i 

In conclusion, the reviewer would like to repeat what was stated at 
the outset: this book is indeed a solid, meaty work. The exigencies ° 
wartime research have, to be sure, prevented as complete and close knit 
a series of studies as one might desire. But the author has striven con? 
scientiously to present the information (both favorable and unfavorable) 
that has been gleaned. The task was immense, and the reviewer cannot 
close the book without a profound feeling of gratitude for a significant 
accomplishment. The volume makes contributions both of methods and 
results. 

HERBERT S. CONRAD. 

U. S. Office of Education. 
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Gipson, JAMES J. (Eb.) Motion picture testing and research. AAF 
Aviation Psychology Research Reports, Report No. 7. Washington, 
D. C.: U. S. Government Printing Office, 1947. Pp. xi4-267. $1.50. 


This volume is the official report of the Psychological Test Film 
Unit at Santa Ana Army Air Base, Santa Ana, Calif. Approximately 
half the report is concerned with the motion picture as a technique 
and is thus of rather specialized interest. Some of these studies are im- 
pressive in the care with which they have been carried out, and some of 
the results bearing on seating, illumination, and their like are clearly of 
use to certain readers. It is, however, the other half of the text that will 
interest most of the readers of the PsvcuoLocicAL BULLETIN. This por- 
tion of the report may be divided into three subdivisions for discussion. 

A ptitude tests. The Santa Ana Film Unit was engaged in part in the 
construction of selection and screening tests for a variety of Air Forces 
assignments. A number of ingenious motion picture tests for Flying 
Orientation, Ability to Judge Distance, Estimation of Velocity, Multiple 
Perception (dial reading) and other abilities were devised and are re- 
ported in some detail. The general reader will be struck by the very low 
correlations between these tests and other tests, between these tests 
and validating criteria, and even between these tests and themselves 
(various reliability criteria). Those who were engaged in similar 
tasks with the armed services will recognize, however, that r's of .25-.55 
may often represent genuine achievements. Nevertheless, the low level 
of predictability makes the permanent usefulness of these studies doubt- 


ful, and they must be accepted primarily as guideposts for others who 


will face such problems in the future. 0. . . 

Airplane recognition research. The investigations which Gibson s 
group carried out in this area are interesting for two reasons. First, 
they illustrate the danger of basing large scale activities on inadequate 
research. The “flash technique” of teaching aircraft recognition was 
accepted by both Army and Navy for reasons that were supported only 
by general theoretical considerations. The Santa Ana group has sub- 
jected the underlying assumptions of the method to a series of careful 
tests, only to discover that zone of those assumptions could be venned 
in practice. Second, the findings have an intrinsic interest. We now 
know that training in making shape discriminations at very high speeds 
does not in itself increase the accuracy of such judgments when longer 
times are available. We also know that frank analysis of the airplane 
gestalt into certain component parts that distinguish it from others will 
aid, at least in the early stages of learning, in making the necessary 


discrimination. , 
General perceptual findings. 
to the perception of aerial space 
important part of the book from 
chologist, Gibson and Glaser h 


The discussion of investigations related 
and distance is undoubtedly the most 
the point of view of the general psy- 
ave presented in Chapter 9 a newly 
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organized "system" of explanation for depth perception. It differs in 
three main ways from traditional systems. First, more emphasis 1s 
placed on monocular stimuli. Second, these stimuli are all treated as 
matters of gradients of retinal stimulation. Third, the treatment of 
“gradients of velocity” for a moving observer is (so far as the reviewer 
knows) entirely original. All three of these departures from classical 
textbook treatments are suggestive and warrant the use of this chapter 
as supplementary reading for students of visual perception. At least 
two other contributions deserve mention. One is Gibson's emphasis on 
position on the vertical meridian as a factor in depth localization. Ver- 
tical position (the so-called “stage effect") is an important influence 
under many conditions of viewing but is often completely omitted from 
discussions of depth. The other is the quantitative study of size con- 
stancy under natural conditions of viewing. The presence of practically 
complete constancy up to nearly half a mile when only the texture of 
the terrain gives any distance cues may force a revision of some notions 
about depth perception based only on laboratory findings and theo- 
retical considerations. 
W. C. H. PRENTICE. 
Swarthmore College. 


MiLLER, NEAL E. (Ep.) Psychological research on pilot training. AAF 
Aviation Psychology Program Research Reports, Report No. 8 
Washington, D. C.: U. S. Government Printing Office, 1947. Pp. 
xix4-488. $1.75. 


This volume summarizes rescarch on pilot training accomplished co- 
operatively by psychologists in the AAF Training Command during 
World War II. The chief objective of Psychological Research Project 
(Pilot) was to develop objective measures of flying skill. Allied studies 
included a job analysis of the primary and specialized functions of stu- 
dent pilots, subjective measures of flying skill, verbal tests of flying 
information, as well as experiments which evaluate selected training pro- 
cedures. The book consists of fifteen chapters in addition to six appen- 
dices which describe the more technical details of the data. The first 
fourteen chapters deal mainly with the methods and procedures em- 
ployed in gathering and analyzing data obtained on large samples of 
Air Corps personnel under various conditions of flight-training, such 
as primary, basic and advanced levels, in single-, two-, and multi- 
engine planes, in contact and instrument flying. An excellent over-all 
summary of the project, together with recommendations for further 
work, is to be found in the concluding chapter. 

It is scarcely surprising to find, in spite of the unusually skillful and 
imaginative approach of the investigators that progress in this new and 
complex area of psychology proved to be hard won. For the most part, 
coefficients of reliability and validity of the indices ran low, for reasons 
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of a sort already evident to most workers who have attempted psycho- 
logical research under similar "field conditions." Lack of independent 
judgments from check-riders who determined the all-important pass- 
fail criterion, differences in standards from one training school to 
another (or even within different groups of check-riders at the same 
School), halo-effects as suggested by strong intercorrelations among 
different aspects of performance—these are only some of the factors 
contributing to the variance of the data. Further difficulties inherent in: 
this area of research are emphasized by the finding that pilot perform- 
ance could be measured with high reliability on the same day even with 
different observers, but that reliability coefficients tended to drop to 
zero when the same indices of performance were evaluated on another 
day by different observers. As the authors emphasize: “The airplane is 
à complex and variable piece of equipment and the air is an unstable 
laboratory |,” : ! 

Among the important accomplishments of this unit was the devel- 
opment of an 81-unit scale for measuring skill in instrument flying, 
Which was sharpened to the point of fairly high reliability for this 
kind of work (test-retest reliability =.46). More significant still was 
?n experimental study which determined under reasonably well-con- 
trolled conditions the extent to which additional training on a low- 
Powered plane transfers to learning the technique of piloting a higher- 
Powered plane. Within the particular circumstances of the eming: 
transfer proved to be small enough to justify the rS pa in 
Seneral 10 weeks of advanced two-engine training, all on the -25, 
Was better than 15 wecks of training, 10 of which was on pp Ai ge 
two-engine planes and only 5 on the TB-25. sen «a E pen ns 
further experimental evidence, it was possible to conclu hi h students 
the Conditions of the investigation, "the type of m ^ E m advanced 
‘ceive their basic training does affect their per dier ine in providing 
training,” Beyond the unique value that such stu ien E ME 
Information critical to efficient training, they also Ml for the quise 
Casibility of employing objective measures of SER Sel 
Valuation of a large scale pilot-traming tc lesii in the report is 

© the reviewer the most important FES ie of many "testing" 
fhe repeated demonstration of the die erred a job that requires 
"Struments. even when fully exploited, for | conclusive answers. As 
thes €xperimental research in order to obtain 
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Kemp, EDWARD H., AND JOHNSON, A. PEMBERTON. (Eps.) Psychos 
logical research on bombardier training. AAF Aviation Psychology 
Program Research Reports, Report No. 9. Washington, D. ©: 
U. S. Government Printing Office, 1947. Pp. x 4-204. $1.25. 


Perhaps one's first and very general reaction to the military psy- 
chological research carried on during the past war is a certain feeling of 
pride that psychology had come of age and thus was able to make a 
unique and valuable contribution to the war effort. It is reassuring tO 
learn that many of the concepts, principles, and even equipment which 
had been developed in the laboratory could be applicd so readily to 
these military problems. To most of the publications within the Army 
Air Force series the writer reacted in that way. 

Because so much money and time of professionally trained personnel 
was expended in these projects one also expects that they would have 
done more than apply what we knew, and would have made some con- 
tribution to psychological methodology, or to general principles. Phe 
contribution of the present study is one of emphasis, an emphasis upon 
the difficulty in obtaining a reliable criterion. 

Superficially it would seem that the problem of obtaining an accept- 
able criterion for bombardier proficiency should not be dificult. The 
bombardier's military task is to drop a bomb on a particular target, and 
this task can be readily duplicated under non-combat conditions. By 
using a target with concentric rings the magnitude of error could be 
obtained either by estimations made by the pilot, a fellow bombardier, 
or by means of aerial photography. One could not quarrel with the 
validity of such a criterion, for it consisted of a fairly exact duplication 
of the combat job. 

The major difficulty with this criterion, called the circular error» lay 
in the fact that it had zero reliability and hence could give no precise 
indication of any particular bombardier's proficiency. Obviously such à 
criterion had no correlation with any potential predictive measure? 
A number of studies were conducted to determine the sources of errors 
in the criterion measure. 

In the first place the fault of the measure did not lie in the estimatio? 
of the magnitude of error for any single bomb burst. Photographic rec- 
ords could be read with accuracy and the estimates of observers on the 
magnitude of error were in agreement. A series of experiments indi- 
cated that the pilot, the degree of air turbulence, and the particular 
sight used were all factors contributing to the error. (If precision-built 
sights differ from each other, what about some of our home-made labor® 
tory equipment?) An extensive experiment which served to contro 
these sources of error was terminated before completion by v-J Day 
Unfortunately it has not been continued. 


However, the project was not so hampered by the criterion problem 


Ls 


ARMY AIR FORCES AVIATION PSYCHOLOGY PROGRAM 515 


that it was unable to make any contribution to bombardier training. 
It developed tests for selecting bombardier instructors, proficiency tests 
to measure knowledge of bombing and navigation, and was able to con- 
duet a learning study which gave information on the amount of train- 
ing necessary to reach a consistent level of performance in bombing 
accuracy. It was felt by the editors that the project indirectly influ- 
enced the bombardier training program through these and other activi- 
ties so that the quality of training was higher than it otherwise would 
have been. Their beliefs were supported by a letter of commendation 
from the Commanding General of the AAF Central Flying Training 


Command. . 
DrLos D. WICKENS. 


Ohio State University. 

Carter, Launor F. (Ep.) Psychological research on navigator training. 
AAF Aviation Psychology Program Research Reports, Report No. 
10. Washington, D. C.: U. S. Government Printing Office, 1947. 
Pp. ix--186. $1.25. 

This book describes the research, development and services carried 
out by the Psychological Research Project (Navigator) under the Army 
Air Forces Psychology Program during World War II. The following 
kinds of research problems were conducted by the Project: (1) selection 
of navigator candidates, (2) navigator training, (3) development of 
objective ground measures of navigation skill, (4) development of aerial 
measures of navigation skill, (5) learning studies, (6) studies on instruc- 
tion in navigator schools, (7) a study of the relative importance of vari- 
ables involved in air navigation, (8) evaluation of training aids, (9) 
ation instructors, and (10) studies of combat condi- 
oup therapy. This impressive 


selection of navig 
tions, post-combat adjustment, and gr up th d s 
list of psychologically substantial and militarily useful problems is an 
indication that it is possible to conduct important research in iacu 
tion with the action program of fighting a war. The navigator selection 
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well as our present college aptitude tests do in predicting college 
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As is the case with many training devices possessing high "face valid- 
ity," the G-2 Trainer proved to have no special efficacy over equivalent 
time spent in the class room. n 

The principal new contribution to method in applied psychology 18 
the use of factorial analysis and multiple correlation techniques in 
scientific job analysis. Error scores on eight and on 14 navigational 
variables were intercorrelated and two factor analyses were carried out. 
Both of these analyses identified three important factors. From these 
studies, it was possible to designate a particular operation. (estimating 
compass deviation) as the primary factor involved in navigator error. 
This finding led to the recommendation for special training in the op- 
eration and suggestions for re-design of the instrument (astro-compass) 
to minimize the possibility for error. The interrelation of personnel se- 
lection, training and instrument design, and the dependence of these ac- 
tivities upon objective, reliable and valid performance measures 1S 
beautifully illustrated by these experiments. 

The final chapter of the book is an evaluation of this two-year 
program. Although many problems remain unsolved or partially solved, 
it is concluded that the selection testing was successful and that 
the psychologist in the military setting contributed substantially to the 
solution of many problems not usually considered to be “psychological. 
Suggestions are also given for the operation of a peacetime research and 
development department of the Air Forces. 


Jonn L. KENNEDY. 
Tufts College. 


Hosgss, Nıcnoras. (Ep.) Psychological research on flexible gunnery train- 
ing. AAF Aviation Psychology Program Research Reports, Report 


No. 11. Washington, D. C.: U. S. Government Printing Office, 
1947. Pp. viii +508. $1.75. 


This volume is a review and an evaluation of the work of the Army 
Air Forces Aviation Psychology Program in the area of flexible gunnery- 
Although the title of the book implies concern with problems of train- 
ing only, the research of the flexible gunnery group embraced selectio? 
and equipment development problems as well. In fact, the research 
story told in the volume is convincing in demonstrating the "dog after 
its own tail” relation between the selection, training, and equipment 
aspects of psychological research in military or industrial work. The 
program on flexible gunnery began with a consideration of selection 
problems and the development of selection tests. The search for ade- 
quate criteria, however, against which to validate proposed tests of 
gunner selection, made it apparent that a thorough investigation © 
training procedures was necessary. Subsequent effort was directed to- 
ward the improvement of training techniques, the development ° 
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achievement tests and tests of skill in gunnery maintenance, the evalua- 
tion of training devices, and the selection of gunnery instructors. Since 
these teaching problems were not unrelated to questions of more ef- 
ficient ways to use available equipment, interest in operating methods 
developed, leading eventually to inquiries into possible improvements 
in equipment design through the modification of controls, sights, and 
other operational features. This elaboration of psychological research 
interests from selection to equipment problems had many direct paral- 
lels in the work of other research groups during the war. 

The most important systematic problem discussed by Hobbs and 
his collaborating authors is that of establishing a satisfactory criterion 
of gunnery performance. No completely satisfactory criterion was es- 
tablished during the four years of research described in the volume. 
Combat data, which one might expect to be the best possible kind of 
criterion, were of limited validity, since records for a particular gunner 
depended not on his skill alone, but on that of other persons as well, 
both friend and foe. Various synthetic firing devices designed to train 
men in the operation of gunnery equipment were of limited "apparent 
validity” and had reliabilities in the region of .75. Ground firing tests 
with moving targets had reliabilities between .50 and .70. Air firing at 
towed targets or with frangible bullets had reliabilities which were 
generally below .50. In the absence of more satisfactory criteria, there- 
fore, the motion picture record, taken with a gun camera mounted on 
the gunner’s equipment, was accepted as the best check of proficiency. 
Unfortunately, the reliability of scores obtained from the gun camera 
films was seriously affected by the type of course flown by the attacking 
plane and by other temporary conditions. But for both training and 
criterion purposes, the gun camera was not surpassed by any other 
type of scoring device. ue ta f 

A concluding evaluation of the general significance of the gunnery 


research program is provided by the editor: 


The full development . . . of a valuable performance testing pone might 
be considered important. The studies done on training devices are à ed dis- 
tinctive by virtue of their comprehensiveness, but the devices themse ves have 
Such limited use that this work may best be considered for its wartime applica- 
tions. Some of the studies on the use of sights may provide suggestions for 
further research on fundamental problems of visual perception. zw training 
Programs developed and evaluated may have long range value as i i 
of the effective application of principles which have long been ie an as 
sound. Nothing new has been added to knowledge of selection puse uren da 
adequate job simply having been done to meet an immediate nee . vs i 
Seems accurate to say that few of these specific findings of gunnery research are 


of enduring importance. W. E. KAPPAUF, JR 
s E. AUF, JR. 


Princeton University. 
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Cook, Stuart W. (Ep.) Psychological research on radar observer train- 
ing. AAF Aviation Psychology Program Research Reports, Report 
No. 12. Washington, D. C.: U. S. Government Printing Office, 


1947. Pp. iv+340. $1.75. 


Research on the selection and training of radar operators was car- 
ried out by three groups, the NDRC Project, Sc-70, NS-146, the Air- 
crew Evaluation and Research Detachment No. 1 of the Eighth Air 
Force, and the Psychological Research Project (Radar) of the ae! 
Training Command, This book is primarily a summary of the work © 
the Radar Project. 


Apart from having value as a record of wartime psychological con- 
tributions, the report will provide information for those interested m 
the development of selection tests and standardized proficiency tests, 
since it was in these two fields that the project concentrated its efforts. 

Through the sections on proficiency testing, the extreme difficulties 
of administration are emphasized. The continuous change and develop- 
ment of equipment, the differing emphases and doctrines in different 
schools, and the lack of training personnel to provide standardized ad- 
ministration of the proficiency tests and checks, particularly those car- 
ried out under the complex conditions of actual flight, all worked against 
the establishment of the desired evaluation program based upon known 
reliability and validity. 

A distinction is made throughout the report between the observable 
activities and the psychological factors involved in the radar observer $ 
task. This distinction led, at the outset, to a job description, and to 2 
parallel job analysis, the latter employing primarily factors “definitely 
or tentatively isolated in the Aviation Psychology Program" (p. 24). 

'The distinction between description and analysis is reflected in a 
classification of selection tests and, to a less extent, of proficiency tests; 
into two types: job analogy tests inspired by the job description, and 
factor tests suggested by the analysis into psychological components. 
Validation of the selection tests against proficiency measures indicate¢ 
that both types must be used to get the best estimate of future perform 
ance in the training program. The editor of the report predicts that 
tests based on psychological analysis will gradually supersede those 
based on job description. á j 

The primary aim of the project was to establish valid selection i"- 
struments. The research purpose of the proficiency testing program was 
regarded as that of providing criteria for validating the selection tests- 
In addition, performance on training bombing missions was subjecte 
to a critical analysis, with a view to developing criterion measures- As 
in other selection research projects with similar aims, where the final 
task was some performance involving the operation of aircraft, it prove 
to be much simpler to state the aims and general methods of the valida- 
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tion procedure than to complete significant studies. The insurmount- 
able variation in performance contributed by the complex factors in 
aircraft operation attenuated the reliability of the radar bombing erroc 
scores, just as it did in other attempts to obtain criteria of aerial per- 
formance. 

The complexities of validating selection tests are thoroughly dis- 
cussed in a penultimate chapter describing the results of two studies 
employing three batteries of selection tests validated against training 
course grades, against an air-crew classification "radar stanine," and, 
in part, against radar bombing error calculated from camera records 
obtained during aerial training flights. In spite of the besetting difficul- 
lies, a number of significant correlations were obtained. Since the 
results of these studies were available too late for application in the 
wartime selection program, their greatest value will lie in indicating to 
the uninitiated the practical problems which will be present in develop- 
ing and evaluating tests of thissort. — . . . 

A final chapter summarizes the principal points of the text and in 
is in which training research was planned, 


addition discusses several arci e d of organi 
including the fields of selecting and training instructors, and of organiz- 


ing | trainine curriculum. 

g the training curriculu C. S. BRIDGMAN. 
Navy Special Devices Center. 

Dany, J. T. (Ep.) Psychological research on flight engineer og Fi 
Aviation Psychology Program Research Reports, Report No. 13. 


Washington, D. C.: U. S. Government Printing Office, 1947. Pp. 


iii4-227. $1.25. nd 
When the Pearl Harbor debacle gave rise to the e um 
of replacing. heavy bombardment aircraft in m erm 2 d 
Air Forces decided, because civilian pinera i Vr ih pan 
Successfully, to fly in the replacements from " à d Ev 
9f the planes, however, crash landed at sea from u eh 1» 
Army Air Forces called in civilian companies pé pecie m 
their experience with long overseas flights, Pan as sin ak 
found it necessary to adda new crew member w p dide ia 
the flight from the standpoint of thorough knowle ge we igiene is 
af the craft. Fuel ‘consumption, for crampe, = bee beater tee 
Mendously with the power setting, and one o Se i posed 
new position flight engineer, was to compute 
Setting, ! 
gi It took Pan American tw 
Seer, but the Air Forces in 1942 
Sinr designs of the B-29 had been © 
Set ung of the war and were to bein pr i 
a training program immediately 


d one-half years to train a a en- 
1042 did not have two and rine: e 
i ince the be- 

drawing board since 
uam in 1943, so the Air Forces 


h- 
at var ious schools in the sou 
a 
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west. In 1944 some officers of the Psychological Research Unit No. 2 
were assigned the task of preliminary investigation of the requirements 
and special characteristics desirable for B-29 crew members, and in 
1945 the Psychological Research Project (Flight Engineer) was ac- 
tivated with specific directives for the selection and training of flight 
engineers. 

The present report is a history (ch. 1) and description of the work 
of this project. It is notable for its account of work completed (chs. 
4 & 5) in the face of what can only be called major difficulties. Con- 
sider, for example, the problem of establishing evaluative and selective 
criteria (ch. 2) for a new and complex position (1) when the student 
personnel varied from enlisted men with air mechanic experience to 
officer graduates of the Yale Technical Training Command Engineer $ 
Officer Candiate School with B-29 training from the Boeing Factory 
School; (2) when the curriculum was under constant revision of load, 
content, and geographical locus; (3) when there were long delays and 
sometimes failures in procurement of mock B-29 equipment; and, finally, 
(4) the fact that urgent need for flight engineers resulted in extremely 
few failures and wash-backs, thus vitiating even the simple evaluative 
aid of grading systems. It is no wonder that many of the research cri- 
teria proved to be subjective; and the authors point out the need in this 
respect of long-time research studies. An appendix to the report m- 
cludes copies of developed check lists, rating scales, and samples of items 
from the Flight Engineer Proficiency test. 

The job analysis of flight engineer (ch. 3) indicated a responsible and 
key job in the airplane although the position in the command hierarchy 
was never specifically established. The flight engineer's duties involve“ 
the preparation of the flight plan, pre-flight craft inspection, starting 
procedures, proper operation of the power plant throughout flight, keep” 
ing flight log, post-flight inspection, and writing a post-flight summary 
report for the ground maintenance crew. His flight station was in the 
forward pressurized compartment and his instrument panel containe 
36 dials, and a control stand fitted with 28 toggle-switches and 1 
levers. He had primary authority in emergencies which endangered the 
loss of the plane. 

The sixth and final chapter evaluates the research. Compromise 
between theory and practice resulted in only moderate successes, PUt 
valuables ones nevertheless for clearing the beginning ground of wha 
will certainly be important research of the future. There is a straight 
faced inclusion among the factors listed as influencing the trend of '€* 


search: cessation of hostilities. The word “temporary” can almost be 
heard. 


G.R STONE: 
University of Oklahoma. AYMOND 


a l 4 
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WickERT, FREDERICK. Psychological research on problems of redistribu- 
tion. AAF Aviation Psychology Program Research Reports, Report 
No. 14. Washington, D. C.: U. S. Government Printing Office, 1947. 
Pp. vii4-298. $1.25. 


This book is a competent and interesting account of the activities 
of certain American aviation psychologists during World War II. It 
should prove stimulating to academic psychologists and invaluable to 
applied psychologists concerned with similar or related problems in the 
armed forces and elsewhere. The volume contains eight chapters repre- 
senting major research areas in which redistribution psychologists 
worked, as follows: (1) history and personnel of the redistribution psy- 
chological program; (2) the selection of instructors in the personnel dis- 
tribution command; (3) combat leadership; (4) combat validation stud- 
ies; (5) psychological studies of anxiety reaction to combat; (6) surveys 
of experiences of returned personnel; (7) attitudes and preferences of 
returned personnel; (8) evaluation of the redistribution psychological 
program. 

The discussions and conclusions are generally 
careful attention to the statistical significance of data based on samples 
Whose basic characteristics including size, variability, manner of selec- 
tion, and other important relevant factors are specified. Ample tabular 
Material presented and referred to in readily apprehended fashion con- 
tribute greatly to the content and readability of the volume. Brief 
comments on the several types of studies will now be made. 
ate the further utilization of officer pilots, 
bombardiers and navigators and enlisted gunners who had been returned 
from combat duty, the development of test batteries was undertaken. They 
Were designed to indicate general and special abilities, preferences, and other 
Personal characteristics thought to be important for determining assignment 


to duty as: (1) instructors; (2) operating ner m deed ele 
terior; (3) comb: overseas; (4) administrative ar a t 
35 «guber peraninel iid tteries was confined to the selection 


Personnel. Cor i d use of such ba 

$ . Completion and use o i OR cse s A 

of instructors, Satisfactory validation against objective M ait i ied 
Proficiency was not accomplished during the period in which research v possi- 


le. NE M + oad in the volume being reviewed. 
Preliminary validation is not discusd in descriptive, an ecdotal delinea- 


tion meat leadership. Early studies prov! 1 bat leaders which was 
of behavior of successful and unsuccessful air Com d to check the initial 
h to Prepare rating scales. The latter were then employ spen romotions and 
env ations. Study of the interrelationships P Pee ra all cases several 
€ test and biographical data was à third appro haracteristics were used 
Pendent samples of adequate size and appropriate c i merged) as an indi- 
ross-check findings. ‘The successful combat leader » both as a flyer and 
SE who exhibits outstanding proficiency in his pases SEn in his job 
Perfo o tical expert, shows consideration for his men, 
ance, is sincere and self-sacrificing-" 


distinguished by 


Selection of instructors. To facilit 


9c 
Vi 
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Combat validation studies. Due to the unavailability of uscful a, 
combat performance, a data blank was developed for recording informa " 
obtained from the returnees themselves. Included among the items W ere mis 
sions flown, promotions while overseas, aircraft destroyed, highest decoration 
awarded, precombat flying accidents, flying accidents in combat, position flow B 
and average overseas efficiency rating. The limitations on the use ol these € n 
teria are discussed as are various controls applied as a result of auper e unie 
in previous studies within the Aviation Psychology Program of the ae 
between aircrew-candidate test scores and combat criteria. Eley en s ie 
covering two air forces, three test batteries and the positions ol bombare um 
navigator, first pilot, copilot and fighter pilot were studied. The N's range 7 
from 85 to 1125. “To determine the degree to which stanines and test st ns 
were associated with performance in combat as judged by the several P 
each of the samples was successively dichotomized on the basis of each per i 
nent criterion, and the means of each of the stanines and test scores for the d 
and low criterion groups were determined. The significance of each of the ¢ 
ferences between the corresponding means of the two groups was iny pies ae 
by use of Fisher's / test." None of the differences reported was cet 
significant, although it is stated that " ... . some of the results suggest 
existence of significant relationships. . . . "' . —: 

Psychological studies of anxiety reaction to combat. “Operational fatig A 
or "anxiety reaction" characterized by tension, tachycardia, sweating, Mi 
somnia, nightmares, loss of appetite, irritability, inability to concentrate yw 
low motivation was found by psychiatrists to occur in 5 to 10 per cent of Is : 
ing officers and 15 to 20 per cent of flying enlisted personnel. No relations 
between the aircrew candidate tests or the officer-instructor-selection tests, ym 
anxiety reactions to combat could be found. It may be of interest that ms 
Aiming Stress Test or Steadiness under Pressure, designed specifically to meas : 
ure resistance to verbally induced emotional responses during a psychomo 
performance, had no value for predicting anxiety reaction. The questionna d 
known as the Personal Inventory consistently showed a biserial gorrelit 2n i. 
the order of .50 with psychiatric diagnosis. It should be clearly understood t ne 
this does not represent prediction of future behavior since the scores 0n B x 
Personal Inventory were obtained at approximately the same time the Pe 
chiatric diagnosis was made though the two results are independent. The ce 
vice is merely an aid to psychiatric screening. . ms 

Surveys of experiences of returned personnel. This chapter contains 2 Ire 
body of detailed data representing the experiences and impressions of pid 
thousand men in respect to fear and courage in aerial combat, causes of mls. 
failures, and disorientation-while-flying. Limitations of space preclude TE 
attempt to summarize. The findings contained hercin should prove of cons! 
erable value to those responsible for training and instrument design. "T 

Attitudes and preferences of returned personnel. As in the case of the LM 
ceding chapter, this one does not lend itself to ready summarization. its 
finding is of particular interest. "How the man reacts psychologically to 9 iā 
bat determines his attitude (toward further combat service). The nimi : 
count of hours, missions, and the like, does not." And yet, curiously eno" E 
as stated in the previous chapter, “The hope of returning home t 
complishing a predetermined number of missions was claimed as an a defi- 
factor in the control of fear. Seventy-five percent favored rotation alte 
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nite number of missions, against 17 percent in favor of having each man fly until 
2s he began to show signs of combat strain." 
. Dowarp E. BAIER. 


Personnel Research Section, A.G.O. 


Bijou, Stpney W. (Ep.) The psychological program in AAF con- 
valescent hospitals. AAF Aviation Psychology Program Research 
Reports, Report No. 15. Washington, D. C.: U. S. Government 
Printing Office, 1947. Pp. viii4-256. $1.25. 

In September, 1944 the Army Air Forces Convalescent Hospitals 
were transferred to the Personnel Distribution Command and a Con- 
valescent Branch of the Psychological Division of that Command was 
established with Captain Sidney W. Bijou as Officer-in-Charge. Psy- 
chologists in the Convalescent Hospitals had five principal functions: 
convalescent orientation, initial evaluation, progress evaluation, coun- 
seling, and research on the psychological problems associated with 
convalescent activities. At the peak of the program there were fifty 
officers and eighty-five enlisted men assigned to the psychological serv- 
ices branches of the twelve hospitale. Approximately one hundred and 
fifty thousand individual services were rendered to approximately 
thirty thousand convalescent patients. Almost half of the convalescent 
patients were classified as neuro-psychiatric, with anxiety reaction as the 
most common diagnosis. Typical patients were hospitalized for six to 
eight weeks exclusive of convalescent leaves and furloughs. . 

While the psychological programs in the various hospitals differed 
to some extent, psychologists normally participated first by undertaking 
a major share of the orientation discussion prior to each patient's 
transfer from the admitting ward to the convalescent hospital proper. 
Next, the psychologist conducted an initial evaluation of the patient on 
the basis of available records, inventories, tests, a case history interview, 
and a mental status interview. The psychologist then made ra 
mendations to the personal physician, the most important of "^ ddi 
to do with the patient's daily five-hour program of activities. i pa idi- 
tion to the services rendered to all incoming patients, psyc : icem 
Were frequently asked to make special evaluations either to supp em 
the services of a psychiatrist or to provide personal, vocational, an 
educational guidance. 

_ Psychological research in t 
oriented toward producing instru 

Ome of the instruments, such as t 


Dventory, the Wechsler-Bellevue Scale a : ; 
Were adapted from civilian forms. Others, such as attitude scales specific 


the military situation, a biographical inventory and a test entitled 


ficiency of Mental Application, were constructed on the pm "— 
Studies of validity were almost wholly dependent upon p y 


t hospitals was entirely 
ments for more effective services. 
he Rorschach, the Shipley Personal 
nd the Cornell Selectee Index, 


he convalescen 
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rather than behavioral criteria and reflect the well-known inadequacies 
of such criteria. Validity coefficients for the inventory type of in- 
strument under these conditions ran quite high, e.g., a bi-serial z of .67 
for the revised Shipley Personal Inventory. aM 
Two exploratory studies of biographical information indicated phe 
feasibility of using such items in the statistical prediction of a psy" 
chiatrist's estimate of a patient's prognosis. The authors are carcful to 
point out, however, that since the psychiatrists used some of the same 
biographical information, the criterion is, from one point of view, con- 
taminated. "M 
The Rorschach Test and the Bender-Gestalt Test were studied in 
relation to the available criteria without conclusive results. This checks 
with other military studies of this type of instrument. It must be cither 
that Dr. Rorschach's ink spots cast a peculiar magical spell over a 
psychological mind or that our system of statistical logic urgently needs 
some extensions. The Incomplete Sentences Test, by way of contrast, 
demonstrated substantial validity for differentiating groups of pena 
judged by psychologists to be psychologically unfit for duty pedi is 
although its correlation with the psychiatric criterion was considerably 
smaller (r —.39 and .41). é 
A method for quantifying the inter-personal behavior observed in 
group counseling was developed. Individual statements by patients ane 
the group leader were classified under such headings as “displacement, 
"projection," "abreaction," "accepting," "reflecting," etc. Progress 
during a series of sessions was plotted with respect to the frequencies 9 
the different types of responses. Š 
The appendix contains the formats of several special instrument 
developed for use in the program. 


n u 


GronGE A. KELLY. 
Ohio State University. 


CRAWFORD, M. P. (ED.) Psychological research on operational training m: 
the Continental Air Forces. AAF Aviation Psychology Program cp- 
search Reports, Report No. 16. Washington, D. €: U. S. Gover? 
ment Printing Office, 1947. Pp. vii+367. $1.50. 


The avowed purpose of this volume is to “present a record of po 
research activities of aviation psychologists in the Continental Al 
Forces" ;and " .... . to provide a summary of psychological researc da 
operational training that will be of value to future research workers s 
this field." The first of these objectives has been accomplished in z 
admirable way. The style of the report is concise and clear. The a 
ganization is good, and there is little variation in general level of irea 
ment from chapter to chapter—a fact which speaks well for the C? 
with which the editing has been done. 


———— G - 
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Following an introductory chapter which treats the history, operat- 
ing procedures, and organization of the Continental Air Forces as these 
were related to the mission of the research psychologists, there are nine 
chapters on individual air crew positions. The chapter headings are as 
follows: Fighter Pilot Studies, Photo-Reconnaissance Pilot, Airplane 
Commander, Copilot, Navigators, Bombardier, Flight Engineer, Radar 
Observer, and Flexible Gunnery. In each case, the results of studies 
designed to determine the relative validities of all classification tests for 
predicting the success in operational training of the various members 
of the aircrew are reported and evaluated. j 

The next three chapters concern the selection and evaluation of lead 
Crews, studies in the acquisition and retention of aircrew skill, and the 
results of a questionnaire which was designed to determine the attitude 
" toward further duty of aircrew personnel returned from combat. The 
final chapter is a summary and evaluation. In many ways this is the 
most valuable chapter in the book, for it is here that the editors present 
a critical evaluation of the work as a whole. 

"This reviewer does not intend to be captious when he states that the 
methods employed, the general type of result obtained, and the diff- 
Culties which were encountered are familiar to nearly all peser 
Who were engaged in research on military problems. mi p^r the 
Second objective of this volume was accomplished depended in b. 
Part on forces over which the authors had no control. If military € 
Niques, training methods and organization change Dam adam bond. 
cally, the data reported will be of almost no value. On t ‘th vhat has 
the book does serve to acquaint research psychologists cs m E "4 
already been achieved and need not be repeated. For this purpose, 


has been well written. Cart H. WEDELL. 


University of Wisconsin. 


LrerLey, W. M. (Ep.) Psychological research in the gA xd i 
Aviation Psychology Program Research Mem ffs 1947. Pp. 
Vashington, D. C: U. S. Government Printing 
E MM d in the research 
A total of 16 officers and 31 enlisted men i.e renont is oon 
Wities of the five principal projects with - m E first-hand in- 
Ned. The primary missions were two-fold: to rerum as Umi 
<r Mation in the various theaters of war that bor: lection and clas- 
sin trio data against which to validate the M orsus for the various 
Ln tests; and to collect data on the require 


act 
Cer 


Sk x 
© Of airmen, d with the development Es 
Seconda; = rere concerne : titude 
Prof: darily, the missions wer the evolution of ap 


lency tests for specialized personnel; 
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tests not adequately covered by the original test battery, and such 
special problems as pilot fatigue in connection with long missions. -— 

As for combat effectiveness, the criterion measures were selecte 
from four categories: 


1. Objective measures of combat proficiency, such as accidents, casualties 


and number of enemy planes shot down. : 

2. Records of administrative actions taken in combat units (e.g., transter 
of inefficient personnel or their reassignment). 2 

3. Ratings based on direct and systematic observation of combat effective- 
ness in specific missions or analysis of records thereof. 

4. Ratings b: 


ased upon general impressions, reports and incidental observa- 
tions. 


The investigators frankly state the various attenuating and biasing 
factors that affected the objectivity of their criteria. Included are halo 
effects in ratings, shrinkage of original populations, goodness of oppor- 
tunity in determining kill scores of fighter pilots, atmospheric condi- 
tions, enemy opposition and deficiencies in record making and keeping, 
as illustrated by the heterogeneity (both inter-unit and intra-unit) of 
test batteries employed, scoring formulae applied, the time limits set 
and modes of recording scores of the sclection and classification tests. 
Other inconveniences and frustrations of a pioneer attempt to fit the 
best man into each particular job under combat conditions are objec- 
tively described. 

The report states that “the predictors used in the classification pro- 
gram were, in general, in some degree effective in the selection for 
combat" (p. 194). Of the 27 predictors for the bombardier category, 18 
(66.7%) were predominantly positive. The 37 navigator predictors 
showed 16 or 43.2% that were positive for the most part. The fighter 
pilot category indicated 25 (58.1%) of the 43 items as positive as com- 
pared with 27 (61.495) of the 44 predictor items in the bomber pilot 
category. : 

The fundamental inadequacy of selection and classification in- 
struments employed is said to reside in their failure to predict the more 
complex and situational personality characteristics such as "'emotiona 
control or stability, judgment, maturity, leadership and motivation 


(p. 194)—traits that “experts” judge high among the requirements for 
successful combat air crew personnel. 


N. H. Pronko- 

University of Wichita. 

DEEMER, WALTER L. (Ep.) 
AAF Aviation Psychology 
18. Washington, D. C.: U 
Pp. vii-+621. $2.25. 


4 " e 
This volume is probably the least interesting reading and at the San 
time the most important of the entire series of AAF Aviation Psycho 


Records, analysis, and test pra 
Program Research Reports, Report AT. 
- S. Government Printing Office, d 


wt 
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ogy Research Reports. It is composed of twenty-three chapters divided 
into three parts and five appendixes. Part ] entitled “Test Procedures," 
"ss has a threefold purpose: first to givea brief account of the trial- 
and-error procedures that characterized early air-crew classification in 
the AAF Training Command and led to the adoption of standardized 
testing procedures; second, to present these standardized procedures in 
detail as an historical record of how standardization Was maintained in 
air-crew classification in World War II; and third, to provide a basis for 
post-war testing procedures.” 

Part II, devoted to “Records Procedures,” contains an excellent 
detailed historical description of the record keeping systems employed 
and should be useful to anyone desiring to use the records for research 
purposes in the future. One chapter (XIX) gives a detailed description 
of the procedures for “Obtaining Sums of Squares and Products with the 
IBM Alphabetical Accounting Machine," 

Part II, “Analysis Procedures,” is devoted mainly to enumerations 
and derivations of formulae for statistical analysis of data. 

The five appendixes are reprints of various manuals for test admin- 
istration, classification, and processing of candidates for air-crew. The 
volume is profusely illustrated with reproductions of record forms, 
punch cards, and pictures. This volume will be of inestimable value to 
researchers of the future who may wish to make special studies using 
data already available because of the minutely detailed descriptions of 
procedures and methods of record keeping. 

E. R. Henry, 

Richardson, Bellows, Henry & Co. 


Fitts, P. M. (Ep.) Psychological research on equipment design. AAF 
Aviation Psychology Program Research Reports, Report No. 19, 
Washington, D. C.: U. S. Government Printing Office, 1947. Pp, 
xii 4-276. $1.25. 

The purpose of this volume is to review the research accomplished 
on equipment design and related problems of engineering psychology 
by the Army Air Forces. The work reported on was performed prior to 1 
October 1946 at the Department of Psychology, AAF School of Avia- 
tion Medicine, and the Psychology Branch, Aero-Medical Laboratory, 
Engineering Division, Wright Field. 

There are 21 chapters all together, 17 of which are reviews of sepa- 
Tate research projects written by the individuals who originally oe 
Out the investigations. The first chapter outlines and oe the 
field of engineering psychology, and presents the m e 
methods and techniques and research objectives of t ks ee o; 
Psychology. The second chapter is a survey of sensory disp ay pro pns 
in the design of aviation equipment. Chapter 3is a systematic ana = 
9f human motor abilities related to equipment design and a suggeste 
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program of research. The chapters which follow are detailed reports of 
particular investigations in one or both of these two arcas. . 

Although the chapter reports which constitute the major part of this 
volume are oriented in terms of the practical problems of equipment 
design, they contribute, in many instances, to the general body of 
knowledge in experimental and applied. psychology. These research 
reports may be classified under four different areas as follows: 


1. Visual display problems. Specific projects are reviewed on the design of 
clock dials for reading in military time, a comparison of tables versus graphs as 
a means of presenting numerical data, the design of navigation plotters, the ef- 
fects of dial diameter and angular separation of scale divisions on dial reading, 
and a comparison of cartesian-grid versus polar-grid presentation as they af- 
fect the interpretability of radar scopes. 

2. Tactual-kinesthetic display problems. Studies are reported on the ability 
of pilots to reproduce pressures with the hands or feet, the tactual discrimination 


of knob shapes, shape coding of control knobs, and the location discrimination 
ability in the absence of visual cues. 

3. Sensori-molor interrelations. "There are reports on the control of aircraft 
roll as affected by different attitude indicators, the accuracy of sighting and 
triggering with different types of gunsight handgrip controls, two projects on 
the relation between control motions and the movements of visual indicators, 
the principles of control arrangement for sequential operations, and a compari- 
son of different types of control movements in a compensatory pursuit task. 

4. Environmental effects on visual factors. For example, efforts were made 


to study the effects of anoxia on visual illusions and the influence of positive £ 
on the reading of aircraft instrument dials. 


Although this book portrays the manner in which experimental 
psychologists can apply themselves and their methods to military 
problems, it is apparent that this type of engineering psychology has 
many civilian and industrial applications. The demonstration that à 
small change in an indicator or control system can produce a significant 
improvement in the efficiency of a machine operator is sure to appeal to 
industrialists. Aside from the practical applications, several novel an 
important contributions have been made to systematic psychology 
particularly the results of studies on kinesthetic and tactual discrimina- 
tion and the analysis of man’s motor abilities. ‘The association © 
psychologists with engineers has produced some remarkably good ex- 
perimental apparatus. The manner in which practical military prob- 
lems have yielded themselves to standard experimental methodology 
and statistical treatment is a tribute to the ingenuity of the investiga- 
tors. Inasmuch as each experiment was originally an attack on a specific 
practical problem, there is occasional difficulty in introducing a partic- 
ular chapter. On the whole, however, the editor has produced satisfac- 
tory continuity and an integrated picture of a new trend in psychology: 

LroNAnD C. MEAD. 

Tufts College. 


St 


$F. ————————— 


BOOK REVIEWS 


HinLGagp, Ernest R. Theories of learning. New York: Appleton- 

Century-Crofts, 1948. Pp. viii+-409. $3.75. 

This is a "middle" book. It has the advantages and disadvantages 
of being in between. 

It is half way between the undergraduate and graduate level. Thus 
it can be used profitably at either level, but must be supplemented at 
each. Although the writing is clear and simple, only the most advanced 
undergraduates will have enough background to achieve good under- 
standing without carefully planned supplementary readings and lec- 
tures. At the same time, most teachers at the graduate level will want 
to go beyond this book with assignments from original materials and 
more detailed discussions. At both levels, the suggested readings at the 
end of each chapter will be helpful. 

This book is also somewhere between a completely impartial survey 
and a polemic treatise that sets out to make a case against stimulus- 
response theories and for field-cognitive ones. The reviewer, who hereby 
warns the reader of his preference for stimulus-response concepts, will 
have more to say later about the direct and indirect ways in which the 
author's opinions influence his book. 

Finally, it is a middle-sized book. The topics covered and the pages 
devoted to each are as follows: Introduction, 18; Thorndike, 32; Guthrie, 
24; Hull, 40; Skinner, 30; Current Functionalism, 31; Gestalt, 32; 
Lewin, 25; Wheeler, 27; Tolman, 33; Theories Influenced by Field 
Conceptions, 31; A Point of View, 37; and References, 34. 

In digesting the material in his extensive list of almost 600 references 
and condensing it to a mere 360 pages of text, the author has performed 
a difficult task which makes his well-indexed book E convenient and 
valuable source of titles. On the other hand, a truly rigorous analysis of 
the logic of learning theories with a critical evaluation of experiments is 
far too detailed and lengthy a task to be covered completely in the 
relatively few pages used. Within the wide range of material he con- 
siders, the author has had to a od "€ and touch relatively 
lightly a iudges to be the high spots. 

i enlie this: necessary selection is influenced by the vend 
preference for field-cognitive concepts and reaction agamst ge ir us- 
response ones. For example, he gives appr oximately 16.5 ur e to 
Norman R. F. Maier and only 1.3 to Kenneth Spence. In apis mg 
Maier’ Its he says (p. 307): “These results are taken to be critica 
- aier's results d aer such as those of Spence (1936) which reduce 
hows i to modifications in the relative strengths of eai id 
ing tendencies." But Spence's theory and the experiments supporting 
529 
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it are not described. Similarly, with respect to the continuity -discon- 
tinuity controversy, the experiments by Lashley and Wade are de- 
scribed in considerable detail (somewhat tempered by a footnote 
pointing out that all of them have the same unbalanced feature and 
hence are uncontrolled), while those on the other side (e.g. by McCulloch 
and Pratt and by Spence) are omitted from the text. . 

Another type of selection is taking the space to elaborate field- 
cognitive types of theory beyond their author's published statements, 
showing how they might fill in gaps and deal with embarassring experi- 
mental contradictions, but not doing this for stimulus-response theories. 
For example, pains are taken to develop what Koffka might have said 
if he had been more systematic about it, while stimulus-response 15 
criticized for not handling punishment adequately; but there is no at- 
tempt to develop the relevant S-R concepts, no description of Mowrer's 
work on anxiety, and a serious misstatement (p. 62) of what Guthrie 
should predict. 


Another way the author's opinion must be expressed is in deciding 
which concepts to examine critically and which to pass over uncritically. 
Here the author's critical examination of Thorndike contrasts with his 
much less critical acceptance of Skinner. Evidence against Thorndike's 
principles is presented in some detail; such fundamental questions as 
whether discriminated operants can be shown to follow different laws 
from conditioned respondents are not raised. 

The choice of what material to include, to elaborate, to criticize is à 
legitimate matter of opinion. Attention has been directed to this only 
because its influence, though no less real, is much less obvious than that 
of direct argument. In fact, the introductory student has no way of 
detecting a bias of this kind. 

The author's opinion also shows up occasionally in the choice of 
emotionally-toned words. For example, the hypothesis of movement- 
produced stimuli is referred to as an “ever-present alibi” while Wheeler $ 
hypothesis that learning follows the laws of embryology and physical 
growth is described as an “audacious proposal.” Some of the strongest 
opinions are expressed in informal editorial asides as in the statement 
(p. 144): “If the notion of operant behavior can be defended and ex- 
tended, it may do much to break down the rigid stimulus-response con? 
ceptions which have too long been dominant in American psychology: 

It is the reviewer's opinion that Guthrie docs not get a fair hearing. 

To cite a single example, Guthrie's criticisms of opposing theories are 
characterized (p. 73): "When he attempts to show why they are not 
helpful he resorts to superficial criticisms and just occasionally t9 
ridicule." The reviewer agrees that many of Guthrie's criticisms arè 
written in a light and amusing style, but he does not consider them 
superficial. For example, when Guthrie says, “the rat is left bise 
thought," he exposes the central weakness of Tolman's book as we 
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of many other cognitive approaches, namely, that the theory deals with 
thought but has no means of getting it over into action. This is a weak- 
ness that Tolman has subsequently struggled to correct. An adequate 
criticism of the chapter on Guthrie is completely beyond the scope of a 
review; for more details the reader can turn to a discussion by F. D. 
Sheffield. ! 

"Though Hilgard shows his disapproval of the content of stimulus- 
response theory in a variety of ways, he definitely approves the rigorous 
logical form of Hull's theory construction and believes that learning 
science must ultimately be written quantitatively very much as Hull 
writes it. Though approving their content, Hilgard criticizes the loose 
form in which field-cognitive theories are expressed. He says (p. 259), 
“In order for the teachings of gestalt psychology to be assimilated to the 
ongoing body of scientific knowledge about learning, someone needs to 
do for gestalt psychology what Hull has been doing for behaviorism, 
that is, to state the laws or postulates more clearly and precisely, and 
then to design experiments which will true up these laws for the pur- 
poses of scientific prediction in terms of the detailed circumstances 
under which learning occurs. The suggestions by the gestalt psycholo- 
gists have been sufficiently fertile that the carrying out of such a pro- 
gram of extension and validation holds rich promise.” 

It is interesting to compare this suggestion with one issued eighteen 
years before by Professor Hull: "A friendly invitation is extended to the 
Gestalt psychologists, and to such other schools as put forward distine- 
tive theories of learning, to exhibit in similar detail a similar deduction 
from their own principles." ? 

Let us hope that this invitation will not have to be repeated after 
another eighteen years. When the two theories are put into equally 
good form, it will be much easier to judge their content. 

This book is more than a superficial parade of theories. The careful 
reader will find that Hilgard has given a great deal of thought to many 
problems and will find much good common sense with which to agree. 
Hilgard makes an excellent case for the desirability of attempting to 
construct rigorous miniature systems and gives a clear description of 
the way several such systems may eventually be integrated to form a 
more comprehensive theory. He points out the usefulness ei naturalistic 
observation in providing a broad foundation for theory an Ean 
the importance of studying organization and perception, the higher 
and the socially-acquired hierarchy of human motives, 
ttention to the problem of learning capacities, the 
children as experimental subjects, and the im- 
he appropriate organism in studying neurological 


mental processes 
the need for more a 
advantages of young 
portance of selecting t : 
changes correlated with learning. 
! Hilgard's Critique of Guthrie. Psychol. Rev. (In press.) 
? Psychol. Rev., 1930, 37, p. 253. 
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His own theory of “the provisional try” is stated so sketchily and so 
modestly subsumed under "A Point of View” that it is difficult to 
evaluate. He says (p. 337), "Such an interpretation takes all the 
mystery out of the backward action of effect.” The reviewer believes 
that it substitutes some equally knotty problems. Before any predic- 
tions can be made on more than an intuitive basis (i.c. what would Tas a 
reasonable being do if I were in that situation?), the author will have to 
state the laws governing such things as how a provisional behavior route 
is kept in suspension, how it has its provisional status changed by con- 
firmation as an appropriate path of action and non-confirmation as an 
Inappropriate one, how the memorial representation of what was done 
leads to action, and what determines the specific type of action that will 
be considered appropriate. Perhaps the author will attempt to attack 
these problems and formulate his own theory in more rigorous detail in 
a subsequent book. 


NEAL E. MILLER. 
Yale University. 


WoopwontH, R. S. Contemporary schools of psychology. (Rev. Ed.) New 
York: Ronald Press, 1948. Pp. ix 4-279. $3.00. 


In this new edition, Woodworth is still Woodworth: in style, direct 
and informal, sometimes even spritely; in substance, erudite, sane, criti- 
cal but appreciative. Although no new schools have arisen since the 
first edition in 1931 (thank God, say all of us!) there have been impor- 
tant new developments in the existing schools, and these have led Wood- 
worth to a rather drastic reorganization of his book. Indeed very few 
passages seem to have survived intact; and there is much fresh material 
and a fresh appraisal. 

Both Structuralism and the Chicago variety of Functionalism, now 
moribund as distinct schools, receive reduced exposition, though Wood- 
worth calls attention to their permanently valuable contributions. But 
he now distinguishes a more broadly conceived variety of functionalism 
with roots going back to the Greeks. (Titchener said it was merely com- 
monsense, which for him meant that, though it might be valuable, it 
did not have the value of science.) This Functionalism starts ‘with the 
results accomplished by the individual and asks how they were accom- 
plished" (p. 32). This is in fact the psychology of ''Associationists, Old 
and New” including Thorndike and the successors of the Chicago Func- 
tionalists, of the later Behaviorists, and of nearly all applied psychology: 
Indeed the psychology of performance, as Woodworth elsewhere calls it 
provides a broad platform upon which virtually all psychologists today 
can stand; and although he is too modest to make such a claim, he i$ 
one of the chief architects and builders of this platform. 

Woodworth probably underestimates the historical significance of 
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early. Behaviorism and the Meyer-Watson-Weiss revolution. Eclectic 
au fond du coeur that he is, Woodworth is not the man to appreciate a 
merely destructive. force, which is just what he shows Behaviorism to 
have been. Almost nothing constructive, and at the same time distinc- 
tive, from that earlier period has survived, nothing, that is, that was 
not explicit in Functionalism. But it was the revolutionary ardor of 
Behaviorism which convicted the inheritors of Wundt of sterility; no 
introspectionist was ever quite the same after reading Watson. 

The new Associationists and the new Behaviorists have developed 
theories rather than schools—Skinner's valiant positivism possibly ex- 
cepted. Woodworth makes their place in the psychological scheme of 
things clear; but the graduate student preparing for his examinations 
must be prepared to go further. 

Gestalt, Woodworth rightly sees as a reform of Structuralism; but 
it has steadily gravitated towards that “primary Functionalism" which 
is the common denominator of the contemporary scene. Concurrently 
it is winning almost universal acceptance, if not of Gestalt as a move- 
ment, at least of its principal concepts. Woodworth is at home with 
Gestalt, and most Gestalters will recognize that his treatment is a good 
introduction. s 

The analytical psychologists are less likely to be satisfied. Wood- 
worth tries hard to extract the precious metal from the ore of psycho- 
analysis, but I think he is constitutionally unsympathetic. His chapter 
is therefore inadequate as an introduction to psychoanalysis. It is, on 
the other hand, a good statement of what the Freudian movement has 
so far contributed to psychology. 

In his time, McDougall managed to knock together the heads of 
Nearly all his opponents—that is to say, of nearly all d ipiis 
“Such blandishments," Woodworth drily remarks, ''fai b t id ag = 
But McDougall’s contributions to psychology cannot io tof ge ae 
his rs d troversy. Woodworth shows that his influence an 

manners in contro y à + young men who have been taught 
accomplishments were great; the very young 


i even the mention of Mc- 
by those who remember the twenties to scorn 


Dougall are none the less more and more making peie rel 
tion concept (McDougall spoke of purpose or horme), are mo 


: , 

accepting the notion that buc eins amha e e cried ree 
i P i social pressur pumped i 

e pumped into an organism by al p dudum bebes mt 


ipe thia sige tiated ee aiios h r adult motives are 
gin; as Woodworth points out, he does not say that ou 


i of 
instincts—that role he assigned to the antient). MT um 
the immediate future will not be McDougal wol disi din 
resemble his doctrine more than that of any ot his 


Ek n tape Frani, d as part of Gestalt, now shares a chapter 


li ly treate : : 
with Kd Again, I do not feel that Woodworth is sym 
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pathetic; but if he does not sce in these schools great values, neither 
does he see them as absurd. Instead, as always, he asks what corrective 
to our thinking is here, what do these intelligent men sce that the rest 
of us are overlooking. 

Thus the whole book is an exposition of that much misunderstood 
position, eclecticism. It is not mere syncretism, not a mere collection of 
a little that is good from this system or that. Eclecticism is itself a 
System, and the most intellectually arduous of all. In most systems 
the ego drive of the proponent is a part of the theory. The system- 
maker asks of any fact or concept, "Will it enhance my theory?" The 
eclectic asks, “Will it enrich my vision of truth?" Other systematists 
ask, "How can I get people to accept my views?" ‘The eclectic asks, 

What correction in my views must I make in the light of this evidence, 
these considerations?" The eclectic walks down “the middle of the 
road" because that is the best place from which to sce what is going 
on every side. 

This book may well be Woodworth's last testament; from it we may 
all catch a little glimpse of why Aristotle spoke of the truly great man as 
magnanimous—and of what he meant thereby. 


. Horace B. ENGLISH. 
Ohio State University. 
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